Title :
Experiments on acoustic model supervised adaptation and evaluation by K-Fold Cross Validation technique
Author :
Caon, Daniel R S ; Amehraye, Asmaa ; Razik, Joseph ; Chollet, Gérard ; Andre, Rodrigo V. ; Mokbel, Chafic
Author_Institution :
Inst. Telecom, Telecom ParisTech, Paris, France
fDate :
Sept. 30 2010-Oct. 2 2010
Abstract :
This paper is an analysis of adaptation techniques for French acoustic models (hidden Markov models). The LVCSR engine Julius, the Hidden Markov Model Toolkit (HTK) and the K-Fold CV technique are used together to build three different adaptation methods: Maximum Likelihood a priori (ML), Maximum Likelihood Linear Regression (MLLR) and Maximum a posteriori (MAP). Experimental results by means of word and phoneme error rate indicate that the best adaptation method depends on the adaptation data, and that the acoustic models performance can be improved by the use of alignments at phoneme-level and K-Fold Cross Validation (CV). The very known K-Fold CV technique will point to the best adaptation technique to follow considering each case of data type.
Keywords :
hidden Markov models; maximum likelihood estimation; speech recognition; K-fold cross validation technique; acoustic model supervised adaptation; hidden Markov model toolkit; maximum a posteriori; maximum likelihood linear regression; speech recognition; Acoustics; Adaptation model; Data models; Databases; Electronic mail; Hidden Markov models; Speech; k-fold cross validation; maximum a priori; maximum likelihood linear re-gression; speech recognition;
Conference_Titel :
I/V Communications and Mobile Network (ISVC), 2010 5th International Symposium on
Conference_Location :
Rabat
Print_ISBN :
978-1-4244-5996-4
DOI :
10.1109/ISVC.2010.5656264