Title :
On speaker-independent, speaker-dependent, and speaker-adaptive speech recognition
Author :
Huang, Xuedong ; Lee, Kai-Fu
Author_Institution :
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
fDate :
4/1/1993 12:00:00 AM
Abstract :
The DARPA Resource Management task is used as a domain for investigating the performance of speaker-independent, speaker-dependent, and speaker-adaptive speech recognition. The error rate of the speaker-independent recognition system, SPHINX, was reduced substantially by incorporating between-word triphone models additional dynamic features, and sex-dependent, semicontinuous hidden Markov models. The error rate for speaker-independent recognition was 4.3%. On speaker-dependent data, the error rate was further reduced to 2.6-1.4% with 600-2400 training sentences for each speaker. Using speaker-independent models, the authors studied speaker-adaptive recognition. Both codebooks and output distributions were considered for adaptation. It was found that speaker-adaptive systems outperform both speaker-independent and speaker-dependent systems, suggesting that the most effective system is one that begins with speaker-independent training and continues to adapt to users
Keywords :
hidden Markov models; speech recognition; DARPA Resource Management task; HMM; SPHINX; between-word triphone models; codebooks; dynamic features; error rate; hidden Markov models; output distributions; speaker dependent recognition; speaker-adaptive recognition; speaker-independent recognition; speech recognition; Computer science; Databases; Error analysis; Hidden Markov models; Resource management; Speech recognition; Testing; Training data; US Department of Defense; Vocabulary;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on