Title :
Development of VAD evaluation framework CENSREC-1-C and investigation of relationship between VAD and speech recognition performance
Author :
Kitaoka, Norihide ; Yamamoto, Kazumasa ; Kusamizu, Tomohiro ; Nakagawa, Seiichi ; Yamada, Takeshi ; Tsuge, Satoru ; Miyajima, Chiyomi ; Nishiura, Takanobu ; Nakayama, Masato ; Denda, Yuki ; Fujimoto, Masakiyo ; Takiguchi, Tetsuya ; Tamura, Satoshi ; Kuroi
Author_Institution :
Nagoya Univ., Nagoya
Abstract :
Voice activity detection (VAD) plays an important role in speech processing including speech recognition, speech enhancement, and speech coding in noisy environments. We developed an evaluation framework for VAD in such environments, called corpus and environment for noisy speech recognition 1 concatenated (CENSREC-1-C). This framework consists of noisy continuous digit utterances and evaluation tools for VAD results. By adoptiong two evaluation measures, one for frame-level detection performance and the other for utterance-level detection performance, we provide the evaluation results of a power-based VAD method as a baseline. When using VAD in speech recognizer, the detected speech segments are extended to avoid the loss of speech frames and the pause segments are then absorbed by a pause model. We investigate the balance of an explicit segmentation by VAD and an implicit segmentation by a pause model using an experimental simulation of segment extension and show that a small extension improves speech recognition.
Keywords :
signal detection; speech processing; speech recognition; continuous digit utterance; frame-level detection; noisy speech recognition; speech coding; speech enhancement; speech processing; utterance-level detection; voice activity detection; Acoustic noise; Additive noise; Concatenated codes; Databases; Speech coding; Speech enhancement; Speech processing; Speech recognition; Statistical analysis; Working environment noise; Noisy speech recognition; Voice activity detection; evaluation framework;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
DOI :
10.1109/ASRU.2007.4430182