Hokkaido University Collection of Scholarly and Academic Papers >
Graduate School of Information Science and Technology / Faculty of Information Science and Technology >
Peer-reviewed Journal Articles, etc >
A Robust Speech Communication into Smart Info-Media System
Title: | A Robust Speech Communication into Smart Info-Media System |
Authors: | Miyanaga, Yoshikazu Browse this author →KAKEN DB | Takahashi, Wataru Browse this author | Yoshizawa, Shingo Browse this author →KAKEN DB |
Keywords: | smart info-media system | robust speech recognition | voice activity detection | speech rejection | ASIC | low power consumption design |
Issue Date: | Nov-2013 |
Publisher: | Institute of Electronics, Information and Communication Engineers |
Journal Title: | IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences |
Volume: | E96A |
Issue: | 11 |
Start Page: | 2074 |
End Page: | 2080 |
Publisher DOI: | 10.1587/transfun.E96.A.2074 |
Abstract: | This paper introduces our developed noise robust speech communication techniques and describes its implementation to a smart info-media system, i.e., a small robot. Our designed speech communication system consists of automatic speech detection, recognition, and rejection. By using automatic speech detection and recognition, an observed speech waveform can be recognized without a manual trigger. In addition, using speech rejection, this system only accepts registered speech phrases and rejects any other words. In other words, although an arbitrary input speech waveform can be fed into this system and recognized, the system responds only to the registered speech phrases. The developed noise robust speech processing can reduce various noises in many environments. In addition to the design of noise robust speech recognition, the LSI design of this system has been introduced. By using the design of speech recognition application specific IC (ASIC), we can simultaneously realize low power consumption and real-time processing. This paper describes the LSI architecture of this system and its performances in some field experiments. In terms of current speech recognition accuracy, the system can realize 85-99% under 0-20 dB SNR and echo environments. |
Rights: | Copyright © 2013 The Institute of Electronics, Information and Communication Engineers |
Relation: | http://search.ieice.org/ |
Type: | article |
URI: | http://hdl.handle.net/2115/54803 |
Appears in Collections: | 情報科学院・情報科学研究院 (Graduate School of Information Science and Technology / Faculty of Information Science and Technology) > 雑誌発表論文等 (Peer-reviewed Journal Articles, etc)
|
Submitter: 宮永 喜一
|