DocumentCode :
1690636
Title :
Joint analysis of vocal tract length and temporal information for robust speech recognition
Author :
Chien-Lin Huang ; Hori, Chiori ; Kashioka, Hideki ; Bin Ma
Author_Institution :
Nat. Inst. of Inf. & Commun. Technol., Kyoto, Japan
fYear :
2013
Firstpage :
7432
Lastpage :
7436
Abstract :
This paper presents a joint analysis approach to address the acoustic feature normalization for robust speech recognition. The variations in acoustic environments and speakers are the major challenge for speech recognition. The conventional normalizations of these two variations are separately processed, applying the speaker normalization with an assumption of a noise free condition and applying the noise compensation with an assumption of speaker independency, and thus resulting in a suboptimal performance. The proposed joint analysis approach simultaneously considers the vocal tract length normalization and averaged temporal information of cepstral features. In a data-driven manner, the Gaussian mixture model is used to estimate the conditional parameters in the joint analysis. Experimental results show that the proposed approach achieves a substantial improvement.
Keywords :
Gaussian processes; acoustic signal processing; cepstral analysis; feature extraction; parameter estimation; signal denoising; speaker recognition; Gaussian mixture model; acoustic environments; acoustic feature normalization; averaged temporal information; cepstral features; conditional parameters estimation; noise compensation; noise free condition; robust speech recognition; speaker independency; speaker normalization; suboptimal performance; vocal tract length normalization; Cepstral analysis; Joints; Noise; Speech; Speech recognition; Training; Joint analysis; feature normalization; speech recognition; vocal tract length normalization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639107
Filename :
6639107
Link To Document :
بازگشت