مرکز منطقه ای اطلاع رساني علوم و فناوري - Improving analysis techniques for automatic speech recognition

DocumentCode :

3294393

Title :

Improving analysis techniques for automatic speech recognition

Author :

O´Shaughnessy, Douglas

Author_Institution :

INRS-Telecommun., Montreal, Que., Canada

Volume :

fYear :

2002

fDate :

4-7 Aug. 2002

Abstract :

Automatic recognition of continuous speech for small vocabularies (e.g., telephone or credit card digit sequences) is possible with excellent accuracy, even in applications using telephone lines and serving a large population of users. However, even such simple recognition tasks suffer decreased performance in adverse conditions, e.g., significant background noise or fading on portable telephone channels. If we further impose significant limitations on the computing resources for the recognition task, then robust efficient speech recognition is still a significant challenge, even for simple vocabularies. Since many practical recognition tasks take place over telephone lines and in conditions that are less than optimal, speech recognition must be robust. The traditional hidden Markov model approach to speech recognition, using cepstral analysis, is computationally intensive and often does not work well under adverse acoustic conditions. We examine a simpler spectral analysis method, and suggest a segmental approach. High recognition accuracy can be maintained, despite a large decrease in both memory and computation, compared to traditional approaches.

Keywords :

acoustic noise; spectral analysis; speech processing; speech recognition; vocabulary; automatic speech recognition; background noise; continuous speech; credit card digit sequences; fading; portable telephone channels; recognition accuracy; recognition task computing resources; robust speech recognition; segmental spectral analysis; small vocabularies; speech analysis techniques; telephone sequences; Automatic speech recognition; Background noise; Credit cards; Fading; Hidden Markov models; Noise robustness; Speech analysis; Speech recognition; Telephony; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Circuits and Systems, 2002. MWSCAS-2002. The 2002 45th Midwest Symposium on

Print_ISBN :

0-7803-7523-8

Type :

conf

DOI :

10.1109/MWSCAS.2002.1186971

Filename :

1186971

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3294393