مرکز منطقه ای اطلاع رساني علوم و فناوري - On the effects of speech rate in large vocabulary speech recognition systems

DocumentCode :

2930097

Title :

On the effects of speech rate in large vocabulary speech recognition systems

Author :

Siegler, Matthew A. ; Stern, Richard M.

Author_Institution :

Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA

Volume :

fYear :

1995

fDate :

9-12 May 1995

Firstpage :

612

Abstract :

It is well known that a higher-than-normal speech rate will cause the rate of recognition errors in large vocabulary automatic speech recognition (ASR) systems to increase. In this paper we attempt to identify and correct for errors due to fast speech. We first suggest that phone rate is a more meaningful measure of speech rate than the more common word rate. We find that when data sets are clustered according to the phone rate metric, recognition errors increase when the phone rate is more than 1 standard deviation greater than the mean. We propose three methods to improve the recognition accuracy of fast speech, each addressing different aspects of performance degradation. The first method is an implementation of Baum-Welch codebook adaptation. The second method is based on the adaptation of HMM state-transition probabilities. In the third method, the pronunciation dictionaries are modified using rule-based techniques and compound words are added. We compare improvements in recognition accuracy for each method using data sets clustered according to the phone rate metric. Adaptation of the HMM state-transition probabilities to fast speech improves recognition of fast speech by a relative amount of 4 to 6 percent

Keywords :

error analysis; error correction; hidden Markov models; knowledge based systems; probability; speech coding; speech recognition; Baum-Welch codebook adaptation; HMM state-transition probabilities; data sets; fast speech; large vocabulary speech recognition systems; performance degradation; phone rate; pronunciation dictionaries; recognition errors; rule-based techniques; speech rate; word rate; Automatic speech recognition; Computer errors; Computer science; Error analysis; Error correction; Hidden Markov models; Natural languages; Speech analysis; Speech processing; Speech recognition; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location :

Detroit, MI

ISSN :

1520-6149

Print_ISBN :

0-7803-2431-5

Type :

conf

DOI :

10.1109/ICASSP.1995.479672

Filename :

479672

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2930097