DocumentCode
3431984
Title
A study of applying subspace based pronunciation modeling in verifying pronunciation accuracy
Author
Yin, Shou-Chun ; Rose, Richard ; Tang, Yun
Author_Institution
Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada
fYear
2012
fDate
2-5 July 2012
Firstpage
59
Lastpage
64
Abstract
This paper investigates a new approach for detecting phoneme level mispronunciations from utterances obtained from impaired children with neuromuscular disorders. This new pronunciation verification (PV) approach is obtained from the subspace based Gaussian mixture model (SGMM) based pronunciation model, where a set of state level projection vectors is applied for representing phonetic variability. SGMM models are trained from disabled speakers´ utterances and PV scores are computed directly from distances between disabled and reference speaker projection vectors. An experimental study was performed to evaluate the performance of the SGMM based approach with respect to an approach based on the lattice posterior probabilities. A reduction in equal error rate (EER) of approximately 15% was obtained when the SGMM based scores were combined with lattice posterior probabilities.
Keywords
Gaussian processes; handicapped aids; speech recognition; EER; PV approach; SGMM; equal error rate; impaired children; neuromuscular disorders; phoneme level mispronunciations; pronunciation accuracy; pronunciation verification; state level projection vectors; subspace based Gaussian mixture model; subspace based pronunciation modeling; utterance; Equations; Hidden Markov models; Mathematical model; Medical treatment; Speech; Statistics; Vectors; automated speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
Conference_Location
Montreal, QC
Print_ISBN
978-1-4673-0381-1
Electronic_ISBN
978-1-4673-0380-4
Type
conf
DOI
10.1109/ISSPA.2012.6310622
Filename
6310622
Link To Document