Title :
Effects of discriminative training on the RACAD corpus of the French language spoken in the Canadian province of New-Brunswick
Author :
Benahmed, Yacine ; Selouani, Sid-Ahmed ; O´Shaughnessy, Douglas
Author_Institution :
INRS, EMT, Montréal, QC, Canada
Abstract :
This paper presents a recognition engine especially tailored to the French language spoken in the Canadian pro-vince of New-Brunswick. It studies a global monophone model that handles the linguistic variability found in the province. The study also explores the impact of speaker locality on recognition rate when using the global model. Three models are implemented for each linguistic poles; North-East, North-West, and South-East. The results show respectively 83.58% and 72.66% phone and word recognition rate for Mel frequency Cepstral coefficients, energy, delta and acceleration parameters acoustic models trained discriminatively with maximum mutual information and minimum phone error criterions respectively. Finally, we observe that the general acoustic models are sufficiently generalized to perform uniformly across the three linguistic poles with an average of 82.8% phone recognition rate across the three different acoustic models.
Keywords :
natural language processing; speech recognition; Canadian province; French language; New-Brunswick; North-East model; North-West model; RACAD corpus; South-East model; acceleration parameter acoustic model; discriminative training effect; global model; global monophone model; linguistic pole; linguistic variability; mel frequency cepstral coefficient; phone recognition rate; recognition engine; speaker locality; word recognition rate; Acoustics; Engines; Hidden Markov models; Maximum likelihood estimation; Pragmatics; Speech recognition; Training;
Conference_Titel :
Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
Conference_Location :
Montreal, QC
Print_ISBN :
978-1-4673-0381-1
Electronic_ISBN :
978-1-4673-0380-4
DOI :
10.1109/ISSPA.2012.6310642