DocumentCode :
921742
Title :
On speaker-independent, speaker-dependent, and speaker-adaptive speech recognition
Author :
Huang, Xuedong ; Lee, Kai-Fu
Author_Institution :
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
1
Issue :
2
fYear :
1993
fDate :
4/1/1993 12:00:00 AM
Firstpage :
150
Lastpage :
157
Abstract :
The DARPA Resource Management task is used as a domain for investigating the performance of speaker-independent, speaker-dependent, and speaker-adaptive speech recognition. The error rate of the speaker-independent recognition system, SPHINX, was reduced substantially by incorporating between-word triphone models additional dynamic features, and sex-dependent, semicontinuous hidden Markov models. The error rate for speaker-independent recognition was 4.3%. On speaker-dependent data, the error rate was further reduced to 2.6-1.4% with 600-2400 training sentences for each speaker. Using speaker-independent models, the authors studied speaker-adaptive recognition. Both codebooks and output distributions were considered for adaptation. It was found that speaker-adaptive systems outperform both speaker-independent and speaker-dependent systems, suggesting that the most effective system is one that begins with speaker-independent training and continues to adapt to users
Keywords :
hidden Markov models; speech recognition; DARPA Resource Management task; HMM; SPHINX; between-word triphone models; codebooks; dynamic features; error rate; hidden Markov models; output distributions; speaker dependent recognition; speaker-adaptive recognition; speaker-independent recognition; speech recognition; Computer science; Databases; Error analysis; Hidden Markov models; Resource management; Speech recognition; Testing; Training data; US Department of Defense; Vocabulary;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.222875
Filename :
222875
Link To Document :
بازگشت