Title :
Vocal tract length normalization strategy based on maximum likelihood criterion
Author :
Jakovljevic, Niksa M. ; Secujski, M.S. ; Delic, V.D.
Abstract :
In this paper performances of automatic speech recognition systems which use vocal tract length normalization (VTN) are presented. Beside standard procedure for VTN coefficient estimation several variants based on robust statistic methods are introduced. All systems which use VTN performed better than referent systems, while the best performance was achieved by the system in which the VTN coefficient for a particular speaker is chosen as the one with maximum sample mean of likelihoods per phoneme. Phoneme likelihoods are calculated as sample medians of feature vectors corresponding to particular phonemes. The relative improvement of performance for this system is about 20%.
Keywords :
maximum likelihood estimation; speech recognition; automatic speech recognition systems; coefficient estimation; feature vectors; maximum likelihood criterion; particular speaker; phoneme likelihoods; robust statistic methods; vocal tract length normalization strategy; Automatic speech recognition; Hidden Markov models; Loudspeakers; Materials testing; Maximum likelihood estimation; Mel frequency cepstral coefficient; Parameter estimation; Piecewise linear techniques; Robustness; Statistics; speech recognition; vocal tract length normalization;
Conference_Titel :
EUROCON 2009, EUROCON '09. IEEE
Conference_Location :
St.-Petersburg
Print_ISBN :
978-1-4244-3860-0
Electronic_ISBN :
978-1-4244-3861-7
DOI :
10.1109/EURCON.2009.5167662