DocumentCode :
1377026
Title :
Zero-crossing-based speech segregation and recognition for humanoid robots
Author :
An, Sung Jun ; Kil, Rhee Man ; Kim, Young-Ik
Author_Institution :
Dept. of Math. Sci., Korea Adv. Inst. of Sci. & Technol. (KAIST), Daejeon, South Korea
Volume :
55
Issue :
4
fYear :
2009
fDate :
11/1/2009 12:00:00 AM
Firstpage :
2341
Lastpage :
2348
Abstract :
Nowadays, humanoid robots attract people since their overall appearance is similar to the human body, allowing interaction with humans and the surrounding environment. In the case of the auditory interaction with humans, it is desirable that humanoid robots have similar capacity to the human¿s auditory information processing system. This is a very difficult task, since current automatic speech recognition (ASR) systems are not quite robust to noise and it¿s hard to attend to the selected speech source. In this context, this paper presents a new method of zero-crossing based binaural mask estimation for speech segregation and recognition, when multiple sound sources are present simultaneously. The proposed method provides high performance of speech segregation and recognition while offers significantly less computational complexity compared to the conventional methods based on cross-correlation. We expect that this method would be able to provide an effective tool for the auditory interaction with humanoid robots using the sensory information of binaural sounds.
Keywords :
estimation theory; humanoid robots; speech intelligibility; speech recognition; auditory interaction; binaural mask estimation; humanoid robot; speech recognition; zero-crossing-based speech segregation; Acoustic noise; Automatic speech recognition; Humanoid robots; Humans; Information processing; Noise robustness; Speech coding; Speech enhancement; Speech recognition; Working environment noise; zero-crossings, sound source localization, speech segregation, speech recognition;
fLanguage :
English
Journal_Title :
Consumer Electronics, IEEE Transactions on
Publisher :
ieee
ISSN :
0098-3063
Type :
jour
DOI :
10.1109/TCE.2009.5373808
Filename :
5373808
Link To Document :
بازگشت