Title :
A Robust Endpoint Detection Algorithm for Video Caption Generation
Author :
Li, Qi ; Ma, Huadong ; Feng, Shuo
Author_Institution :
Beijing Key Lab. of Intell. Telecommun. Software & Multimedia, Beijing Univ. of Posts & Telecommun., Beijing
Abstract :
This paper proposes a robust endpoint detection algorithm for continuous speech in noisy environment, and it can be used in automatic video caption generation systems. In the proposed algorithm, we integrate the widely used energy, zero crossing and entropy to form a new feature, EZE-feature, which possesses advantages while compensating the drawbacks of each individual. Moreover, we propose a robust endpoint detection method which makes the EZE-feature modify its environment parameters by adapting to the strength of background noise. The proposed algorithm has been used in an automatic video caption generation system, and the performance of the proposed algorithm is very well.
Keywords :
entropy; speech recognition; video signal processing; EZE-feature; automatic video caption generation; continuous speech recognition system; entropy; robust endpoint detection algorithm; zero crossing feature; Acoustic noise; Background noise; Detection algorithms; Entropy; Noise generators; Noise robustness; Speaker recognition; Speech; Telecommunication computing; Working environment noise; Endpoint detection; Speech analysis; video caption generation;
Conference_Titel :
Young Computer Scientists, 2008. ICYCS 2008. The 9th International Conference for
Conference_Location :
Hunan
Print_ISBN :
978-0-7695-3398-8
Electronic_ISBN :
978-0-7695-3398-8
DOI :
10.1109/ICYCS.2008.71