Title :
Hands-free speech recognition based on 3-D Viterbi search using a microphone array
Author :
Yamada, Takeshi ; Nakamura, Satoshi ; Shikano, Kiyohiro
Author_Institution :
Graduate Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Ikoma, Japan
Abstract :
A microphone array is a promising solution for realizing hands-free speech recognition in real environments. Accurate talker localization is very important for speech recognition using a microphone array. However localization of a moving talker is difficult in noisy reverberant environments. Talker localization errors degrade the performance of speech recognition. To solve the problem, this paper proposes a new speech recognition algorithm which considers multiple talker direction hypotheses simultaneously. The proposed algorithm performs a Viterbi search in 3-dimensional trellis space composed of talker directions, input frames, and HMM states. As a result, a locus of the talker and a phoneme sequence of the speech are obtained by finding an optimal path with the highest likelihood. To evaluate the performance of the proposed algorithm, speech recognition experiments are carried out on simulated data and real environment data. These results show that the proposed algorithm works well even if the talker moves
Keywords :
acoustic transducer arrays; array signal processing; direction-of-arrival estimation; hidden Markov models; maximum likelihood estimation; microphones; search problems; speech recognition; 3-D Viterbi search; 3-dimensional trellis space; HMM states; hands-free speech recognition; highest likelihood; input frames; localization errors; microphone array; moving talker; multiple talker direction hypotheses; noisy reverberant environments; optimal path; performance; phoneme sequence; talker localization; Acoustic noise; Degradation; Hidden Markov models; Information science; Microphone arrays; Speech analysis; Speech enhancement; Speech recognition; Viterbi algorithm; Working environment noise;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.674413