Title :
Computational auditory scene analysis and its application to robot audition
Author :
Okuno, Hiroshi G. ; Ogata, Tetsuya ; Komatani, Kazunori ; Nakadai, Kazuhiro
Author_Institution :
Graduate Sch. of Informatics, Kyoto Univ., Japan
Abstract :
We are engaged in research on computational auditory scene analysis to attain sophisticated robot (computer) human interaction by recognizing auditory awareness. The objective of our research is the understanding of an arbitrary sound mixture including nonspeech sounds and music as well as voiced speech, obtained by robot´s ears (or microphones embedded in the robot). The main issues are sound source localization, separation, and recognition at signal processing levels, and signal-to-symbol transformation at the interface level to symbol processing levels. The latter is critical in developmental communication and we are developing an automatic onomatopoeia recognition system. This paper overviews our activities in robot audition, in particular, active direction-pass filter (ADPF) that separates sounds originating from a specific direction by integrating sound source localization and visual processing. ADPF is implemented on three kinds of robots and demonstrates separating and recognizing three simultaneous speeches with a pair of microphones.
Keywords :
active filters; hearing; human computer interaction; robots; source separation; speech recognition; active direction-pass filter; automatic onomatopoeia recognition system; computational auditory scene analysis; microphones; robot audition; robot human interaction; sound source localization; visual processing; Acoustic signal processing; Application software; Ear; Human robot interaction; Image analysis; Microphones; Music; Robotics and automation; Signal processing; Speech;
Conference_Titel :
Informatics Research for Development of Knowledge Society Infrastructure, 2004. ICKS 2004. International Conference on
Print_ISBN :
0-7695-2150-9
DOI :
10.1109/ICKS.2004.1313411