HUSCAP logo Hokkaido Univ. logo

Hokkaido University Collection of Scholarly and Academic Papers >
北海道大学サステナビリティ・ウィーク  >
北海道大学サステナビリティ・ウィーク2009  >
2009年アジア太平洋信号情報処理連合学会アニュアルサミット・国際会議  >

Enhancement of Esophageal Speech Using Statistical Voice Conversion

フルテキスト
WA-P1-3.pdf447.71 kBPDF見る/開く
この文献へのリンクには次のURLを使用してください:http://hdl.handle.net/2115/39810

タイトル: Enhancement of Esophageal Speech Using Statistical Voice Conversion
著者: Doi, Hironori 著作を一覧する
Nakamura, Keigo 著作を一覧する
Toda, Tomoki 著作を一覧する
Saruwatari, Hiroshi 著作を一覧する
Shikano, Kiyohiro 著作を一覧する
発行日: 2009年10月 4日
出版者: Asia-Pacific Signal and Information Processing Association, 2009 Annual Summit and Conference, International Organizing Committee
誌名: Proceedings : APSIPA ASC 2009 : Asia-Pacific Signal and Information Processing Association, 2009 Annual Summit and Conference
開始ページ: 805
終了ページ: 808
抄録: This paper presents a novel method of enhancing esophageal speech based on statistical voice conversion. Esophageal speech is one of the speaking methods for total laryngectomees. Although it allows laryngectomees to speak by generating a sound source and articulating it to produce audible speech sounds using their esophagus and vocal organs, the generated voices sound unnatural. To improve the naturalness of esophageal speech, we propose a voice conversion method from esophageal speech into normal speech (ES-to-Speech). A spectral parameter and excitation parameters, such as F0 and aperiodic components, of normal speech are separately estimated from the spectral parameter of the esophageal speech in the sense of maximum likelihood using different Gaussian mixture models. We conduct objective and subjective evaluations of the proposed method. The experimental results demonstrate that the proposed method yields significant improvements in naturalness of esophageal speech while maintaining its intelligibility.
記述: APSIPA ASC 2009: Asia-Pacific Signal and Information Processing Association, 2009 Annual Summit and Conference. 4-7 October 2009. Sapporo, Japan. Poster session: Speech Processing (7 October 2009).
資料タイプ: proceedings
URI: http://hdl.handle.net/2115/39810
出現コレクション:2009年アジア太平洋信号情報処理連合学会アニュアルサミット・国際会議 (2009 APSIPA Annual Summit and Conference)

 

本サイトに関するご意見・お問い合わせは repo at lib.hokudai.ac.jp へお願いします。 - 北海道大学