Title :
A stochastic articulatory-to-acoustic mapping as a basis for speech recognition
Author :
Hodgen, J. ; Valdez, Patrick
Author_Institution :
Modeling, Algorithms & Inf. Group, Los Alamos Nat. Lab., NM, USA
Abstract :
Hidden Markov models of speech acoustics are the current state-of-the-art in speech recognition, but these models bear little resemblance to the processes underlying speech production. We describe two stochastic models of speech production: maximum likelihood continuity mapping (MALCOM) and conditional observable maximum likelihood continuity mapping (CO-MALCOM). The main component of these models is a stochastic mapping between acoustics and speech articulation. Perhaps surprisingly, the parameters of the stochastic mapping can be found using only acoustic data. We discuss theoretical and experimental reasons to believe that MALCOM and CO-MALCOM learn a stochastic mapping between articulator positions and speech acoustics. CO-MALCOM can be combined with standard recognition algorithms to do speech recognition based on a production model
Keywords :
maximum likelihood sequence estimation; speech coding; speech recognition; stochastic processes; vector quantisation; VQ code; conditional observable maximum likelihood continuity mapping; inverse mapping; maximum likelihood continuity mapping; speech articulation; speech coding; speech production model; speech recognition; stochastic articulatory-to-acoustic mapping; stochastic models; Acoustics; Automatic speech recognition; Hidden Markov models; Informatics; Laboratories; Maximum likelihood estimation; Speech processing; Speech recognition; Stochastic processes; World Wide Web;
Conference_Titel :
Instrumentation and Measurement Technology Conference, 2001. IMTC 2001. Proceedings of the 18th IEEE
Conference_Location :
Budapest
Print_ISBN :
0-7803-6646-8
DOI :
10.1109/IMTC.2001.928251