Title :
Robust speech endpoint detection based on MP3 file in various noisy environments
Author :
Wang, Fang ; Huang, Xianglin ; Yang, Lifang ; Liu, Tao
Author_Institution :
Comput. Sch., Commun. Univ. of China, Beijing
Abstract :
In audio retrieval, endpoint detection is a crucial step of front processing. This paper presents a MP3-based approach for robust endpoint detection under different types of noises. Firstly, the modified discrete cosine transform (MDCT) coefficients and scale factors (SCF) are extracted from MP3 file. Then, combined with the selection of SCF, a set of MDCT coefficients of white noise are used to enhance the speech signals. Finally, the spectral entropy-based method is applied to MDCT coefficients to gain the proposed parameter. Experimental results show that this algorithm has excellent performance in various adverse environments. Furthermore, for using the intermediate date of MP3 decoding, the proposed one has much less computational complexity.
Keywords :
discrete cosine transforms; information retrieval; multimedia computing; speech enhancement; MDCT coefficients; MP3 file; audio retrieval; modified discrete cosine transform; noisy environments; robust speech endpoint detection; scale factors; spectral entropy; speech signal enhancement; white noise; Acoustic noise; Background noise; Digital audio players; Entropy; Frequency; Psychoacoustic models; Robustness; Speech enhancement; White noise; Working environment noise;
Conference_Titel :
Audio, Language and Image Processing, 2008. ICALIP 2008. International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-1723-0
Electronic_ISBN :
978-1-4244-1724-7
DOI :
10.1109/ICALIP.2008.4590108