DocumentCode :
2930097
Title :
On the effects of speech rate in large vocabulary speech recognition systems
Author :
Siegler, Matthew A. ; Stern, Richard M.
Author_Institution :
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
1
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
612
Abstract :
It is well known that a higher-than-normal speech rate will cause the rate of recognition errors in large vocabulary automatic speech recognition (ASR) systems to increase. In this paper we attempt to identify and correct for errors due to fast speech. We first suggest that phone rate is a more meaningful measure of speech rate than the more common word rate. We find that when data sets are clustered according to the phone rate metric, recognition errors increase when the phone rate is more than 1 standard deviation greater than the mean. We propose three methods to improve the recognition accuracy of fast speech, each addressing different aspects of performance degradation. The first method is an implementation of Baum-Welch codebook adaptation. The second method is based on the adaptation of HMM state-transition probabilities. In the third method, the pronunciation dictionaries are modified using rule-based techniques and compound words are added. We compare improvements in recognition accuracy for each method using data sets clustered according to the phone rate metric. Adaptation of the HMM state-transition probabilities to fast speech improves recognition of fast speech by a relative amount of 4 to 6 percent
Keywords :
error analysis; error correction; hidden Markov models; knowledge based systems; probability; speech coding; speech recognition; Baum-Welch codebook adaptation; HMM state-transition probabilities; data sets; fast speech; large vocabulary speech recognition systems; performance degradation; phone rate; pronunciation dictionaries; recognition errors; rule-based techniques; speech rate; word rate; Automatic speech recognition; Computer errors; Computer science; Error analysis; Error correction; Hidden Markov models; Natural languages; Speech analysis; Speech processing; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479672
Filename :
479672
Link To Document :
بازگشت