HUSCAP logo Hokkaido Univ. logo

Hokkaido University Collection of Scholarly and Academic Papers >
Hokkaido University Sustainability Weeks >
Sustainability Weeks 2009 >
2009 APSIPA Annual Summit and Conference >

Speaker segmentation based on between-window correlation over speakers' characteristics

Files in This Item:
WA-P1-6.pdf166.27 kBPDFView/Open
Please use this identifier to cite or link to this item:http://hdl.handle.net/2115/39813

Title: Speaker segmentation based on between-window correlation over speakers' characteristics
Authors: Wang, Gang Browse this author
Zheng, Thomas Fang Browse this author
Issue Date: 4-Oct-2009
Publisher: Asia-Pacific Signal and Information Processing Association, 2009 Annual Summit and Conference, International Organizing Committee
Journal Title: Proceedings : APSIPA ASC 2009 : Asia-Pacific Signal and Information Processing Association, 2009 Annual Summit and Conference
Start Page: 817
End Page: 820
Abstract: Speaker segmentation is widely applied in many domains such as multi-speaker detection and speaker tracking. However, the performance of the conventional metric-based methods is neither good enough nor stable due to the stability of the between-window distance calculation. In order to enhance the stability and hence to improve the performance, a new method based on the between-window correlation over speakers' characteristics is proposed. In this method, a set of reference speaker models are trained which can represent the whole speaker model space. The between-window correlation of likelihood vectors of scores against these reference models is taken as the metric. The gender information and the Peak and Valley information are also used. Experiments over NIST SRE 2002 Segmentation BNEWS and SWBD Datasets show that better performance can be achieved compared with the BIC and the GLR methods. What's more, the proposed method can achieve approximately the best performance in a wider value range of predefined thresholds than the BIC and the GLR methods, which reduces the threshold sensitivity.
Description: APSIPA ASC 2009: Asia-Pacific Signal and Information Processing Association, 2009 Annual Summit and Conference. 4-7 October 2009. Sapporo, Japan. Poster session: Speech Processing (7 October 2009).
Conference Name: APSIPA ASC 2009: Asia-Pacific Signal and Information Processing Association, 2009 Annual Summit and Conference
2009年アジア太平洋信号情報処理連合学会アニュアルサミット・国際会議
Conference Place: Sapporo
Type: proceedings
URI: http://hdl.handle.net/2115/39813
Appears in Collections:北海道大学サステナビリティ・ウィーク2009 (Sustainability Weeks 2009) > 2009年アジア太平洋信号情報処理連合学会アニュアルサミット・国際会議 (2009 APSIPA Annual Summit and Conference)

Export metadata:

OAI-PMH ( junii2 , jpcoar_1.0 )

MathJax is now OFF:


 

 - Hokkaido University