Title :
The Athena-RC system for speech activity detection and speaker localization in the DIRHA smart home
Author :
Giannoulis, Panagiotis ; Tsiami, Antigoni ; Rodomagoulakis, I. ; Katsamanis, Athanasios ; Potamianos, Gerasimos ; Maragos, Petros
Author_Institution :
Sch. of Electr. & Comput. Eng., Nat. Tech. Univ. of Athens, Athens, Greece
Abstract :
We present our system for speech activity detection and speaker localization inside a smart home with multiple rooms equipped with microphone arrays of known geometry and placement. The smart home is developed as part of the DIRHA European funded project, providing both simulated and real data for system development and evaluation, under extremely challenging conditions of noise, reverberation, and speech overlap. Our proposed approach performs speech activity detection first, by employing multi-microphone decision fusion on traditional statistical models and acoustic features, within a Viterbi decoding framework, further assisted by signal energy- and model log-likelihood threshold-based heuristics. Then it performs speaker localization using traditional time-difference of arrival estimation between properly selected microphone pairs, further assisted by a dereverberation component. The system achieves very low detection errors, namely less than 4% (5%) for speech activity detection in the simulated (real) DIRHA corpus, and less than 10% (12%) for joint speech detection and speaker localization.
Keywords :
Viterbi decoding; acoustic signal processing; home computing; microphones; speaker recognition; speech coding; Athena-RC system; DIRHA European funded project; DIRHA corpus; DIRHA smart home; Viterbi decoding framework; acoustic features; dereverberation component; joint speech detection; microphone arrays; microphone pairs; model log-likelihood threshold-based heuristics; multimicrophone decision fusion; noise; signal energy-based heuristics; speaker localization; speech activity detection; speech overlap; statistical models; system development; system evaluation; time-difference of arrival estimation; Acoustics; Direction-of-arrival estimation; Estimation; Microphone arrays; Smart homes; Speech; microphone arrays; smart homes; speaker localization; speech detection;
Conference_Titel :
Hands-free Speech Communication and Microphone Arrays (HSCMA), 2014 4th Joint Workshop on
Conference_Location :
Villers-les-Nancy
DOI :
10.1109/HSCMA.2014.6843273