Discriminative Training Techniques for Acoustic Language Identification

Author

L. Burget;P. Matejka;J. Cernocky

Author_Institution

Speech@FIT group, Brno University of Technology, Czech Republic. burget@fit.vutbr.cz

Volume

fYear

2006

fDate

6/28/1905 12:00:00 AM

Abstract

This paper presents comparison of maximum likelihood (ML) and discriminative maximum mutual information (MMI) training for acoustic modeling in language identification (LID). Both approaches are compared on state-of-the-art shifted delta-cepstra features, the results are reported on data from NIST 2003 evaluations. Clear advantage of MMI over ML training is shown. Further improvements of acoustic LID are discussed: heteroscedastic linear discriminant analysis (HLDA) for feature de-correlation and dimensionality reduction and ergodic hidden Markov models (EHMM) for better modeling of dynamics in the acoustic space. The final error rate compares favorably to other results published on NIST 2003 data

Keywords

"Natural languages","Hidden Markov models","Cepstral analysis","NIST","Linear discriminant analysis","Speech processing","Speech recognition","Feature extraction","Mel frequency cepstral coefficient","Mutual information"

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on

ISSN

1520-6149

Print_ISBN

1-4244-0469-X

Electronic_ISBN

2379-190X

Type

conf

DOI

10.1109/ICASSP.2006.1659994

Filename

1659994

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3622343