HUSCAP logo Hokkaido Univ. logo

Hokkaido University Collection of Scholarly and Academic Papers >
Graduate School of Information Science and Technology / Faculty of Information Science and Technology >
Peer-reviewed Journal Articles, etc >

A Robust Speech Communication into Smart Info-Media System

Files in This Item:
e96-a_11_2074.pdf1.44 MBPDFView/Open
Please use this identifier to cite or link to this item:http://hdl.handle.net/2115/54803

Title: A Robust Speech Communication into Smart Info-Media System
Authors: Miyanaga, Yoshikazu Browse this author →KAKEN DB
Takahashi, Wataru Browse this author
Yoshizawa, Shingo Browse this author →KAKEN DB
Keywords: smart info-media system
robust speech recognition
voice activity detection
speech rejection
ASIC
low power consumption design
Issue Date: Nov-2013
Publisher: Institute of Electronics, Information and Communication Engineers
Journal Title: IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences
Volume: E96A
Issue: 11
Start Page: 2074
End Page: 2080
Publisher DOI: 10.1587/transfun.E96.A.2074
Abstract: This paper introduces our developed noise robust speech communication techniques and describes its implementation to a smart info-media system, i.e., a small robot. Our designed speech communication system consists of automatic speech detection, recognition, and rejection. By using automatic speech detection and recognition, an observed speech waveform can be recognized without a manual trigger. In addition, using speech rejection, this system only accepts registered speech phrases and rejects any other words. In other words, although an arbitrary input speech waveform can be fed into this system and recognized, the system responds only to the registered speech phrases. The developed noise robust speech processing can reduce various noises in many environments. In addition to the design of noise robust speech recognition, the LSI design of this system has been introduced. By using the design of speech recognition application specific IC (ASIC), we can simultaneously realize low power consumption and real-time processing. This paper describes the LSI architecture of this system and its performances in some field experiments. In terms of current speech recognition accuracy, the system can realize 85-99% under 0-20 dB SNR and echo environments.
Rights: Copyright © 2013 The Institute of Electronics, Information and Communication Engineers
Relation: http://search.ieice.org/
Type: article
URI: http://hdl.handle.net/2115/54803
Appears in Collections:情報科学院・情報科学研究院 (Graduate School of Information Science and Technology / Faculty of Information Science and Technology) > 雑誌発表論文等 (Peer-reviewed Journal Articles, etc)

Submitter: 宮永 喜一

Export metadata:

OAI-PMH ( junii2 , jpcoar_1.0 )

MathJax is now OFF:


 

 - Hokkaido University