A Robust Speech Communication into Smart Info-Media System

Miyanaga, Yoshikazu; Takahashi, Wataru; Yoshizawa, Shingo

doi:10.1587/transfun.E96.A.2074


Hokkaido University \| Library \| HUSCAP	Advanced Search		言語

	Home
	About HUSCAP
	Open Access Policy

	Browse by Author

Browse
	Communities & Collections

	Scholarly Journals
	Theses
	Doctoral Dissertations Listed by Graduate Schools
	Conference Procs.
	Events

	HUSCAP Senior (in Japanese)

	Societies

	Downloads (country)

For university staff
	How to post your papers to HUSCAP
	Publication of theses
	Helpline about theses publication

Open Archives Compliant

You can search our collection also at:
	Google
	Google Scholar
	CiNii
	IRDB
	OAIster
	NDLTD

Hokkaido University Collection of Scholarly and Academic Papers >
Graduate School of Information Science and Technology / Faculty of Information Science and Technology >
Peer-reviewed Journal Articles, etc >

A Robust Speech Communication into Smart Info-Media System

Files in This Item:

e96-a_11_2074.pdf

1.44 MB

PDF

View/Open

Please use this identifier to cite or link to this item:http://hdl.handle.net/2115/54803

Title:	A Robust Speech Communication into Smart Info-Media System
Authors:	Miyanaga, Yoshikazu Browse this author →KAKEN DB
	Takahashi, Wataru Browse this author
	Yoshizawa, Shingo Browse this author →KAKEN DB
Keywords:	smart info-media system
	robust speech recognition
	voice activity detection
	speech rejection
	ASIC
	low power consumption design
Issue Date:	Nov-2013
Publisher:	Institute of Electronics, Information and Communication Engineers
Journal Title:	IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences
Volume:	E96A
Issue:	11
Start Page:	2074
End Page:	2080
Publisher DOI:	10.1587/transfun.E96.A.2074
Abstract:	This paper introduces our developed noise robust speech communication techniques and describes its implementation to a smart info-media system, i.e., a small robot. Our designed speech communication system consists of automatic speech detection, recognition, and rejection. By using automatic speech detection and recognition, an observed speech waveform can be recognized without a manual trigger. In addition, using speech rejection, this system only accepts registered speech phrases and rejects any other words. In other words, although an arbitrary input speech waveform can be fed into this system and recognized, the system responds only to the registered speech phrases. The developed noise robust speech processing can reduce various noises in many environments. In addition to the design of noise robust speech recognition, the LSI design of this system has been introduced. By using the design of speech recognition application specific IC (ASIC), we can simultaneously realize low power consumption and real-time processing. This paper describes the LSI architecture of this system and its performances in some field experiments. In terms of current speech recognition accuracy, the system can realize 85-99% under 0-20 dB SNR and echo environments.
Rights:	Copyright © 2013 The Institute of Electronics, Information and Communication Engineers
Relation:	http://search.ieice.org/
Type:	article
URI:	http://hdl.handle.net/2115/54803
Appears in Collections:	情報科学院・情報科学研究院 (Graduate School of Information Science and Technology / Faculty of Information Science and Technology) > 雑誌発表論文等 (Peer-reviewed Journal Articles, etc)

Submitter: 宮永喜一

OAI-PMH ( junii2 , jpcoar_1.0 )

- Hokkaido University