HUSCAP logo Hokkaido Univ. logo

Hokkaido University Collection of Scholarly and Academic Papers >
Hokkaido University Sustainability Weeks >
Sustainability Weeks 2009 >
2009 APSIPA Annual Summit and Conference >

Selection of Reliable Likelihood Ratios for Statistical Model-Based Voice Activity Detection

Files in This Item:
TP-P1-2.pdf300.82 kBPDFView/Open
Please use this identifier to cite or link to this item:http://hdl.handle.net/2115/39773

Title: Selection of Reliable Likelihood Ratios for Statistical Model-Based Voice Activity Detection
Authors: Kim, Younggwan Browse this author
Suh, Youngjoo Browse this author
Kim, Hoirin Browse this author
Issue Date: 4-Oct-2009
Publisher: Asia-Pacific Signal and Information Processing Association, 2009 Annual Summit and Conference, International Organizing Committee
Journal Title: Proceedings : APSIPA ASC 2009 : Asia-Pacific Signal and Information Processing Association, 2009 Annual Summit and Conference
Start Page: 623
End Page: 626
Abstract: A statistical model-based voice activity detection (VAD) is a robust algorithm in noisy condition to detect speech region from input signal by speech and non-speech statistical model such as complex Gaussian probability density function (PDF). The decision rule used in this VAD is based on Bayes' rule and considers likelihood ratios (LRs) in whole frequency region. In this VAD, however, the Bayes' rule may cause a decision error. With the statistical model, we analyze why this problem happens and show how we can decrease the decision error by using the LRs at selected frequency bins having relatively high spectral power in each frame. The performance of this VAD is evaluated by receiver operating characteristic (ROC) curves and summarized in a table, and the results from proposed methods show better performances than those of typical statistical model-based VAD.
Description: APSIPA ASC 2009: Asia-Pacific Signal and Information Processing Association, 2009 Annual Summit and Conference. 4-7 October 2009. Sapporo, Japan. Poster session: Automatic Speech Recognition (6 October 2009).
Conference Name: APSIPA ASC 2009: Asia-Pacific Signal and Information Processing Association, 2009 Annual Summit and Conference
2009年アジア太平洋信号情報処理連合学会アニュアルサミット・国際会議
Conference Place: Sapporo
Type: proceedings
URI: http://hdl.handle.net/2115/39773
Appears in Collections:北海道大学サステナビリティ・ウィーク2009 (Sustainability Weeks 2009) > 2009年アジア太平洋信号情報処理連合学会アニュアルサミット・国際会議 (2009 APSIPA Annual Summit and Conference)

Export metadata:

OAI-PMH ( junii2 , jpcoar_1.0 )

MathJax is now OFF:


 

 - Hokkaido University