مرکز منطقه ای اطلاع رساني علوم و فناوري - An overview of the SPHINX speech recognition system

DocumentCode :

787251

Title :

An overview of the SPHINX speech recognition system

Author :

Lee, Kai-Fu ; Hon, Hsiao-Wuen ; Reddy, Raj

Author_Institution :

Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA

Volume :

Issue :

fYear :

1990

fDate :

1/1/1990 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

A description is given of SPHINX, a system that demonstrates the feasibility of accurate, large-vocabulary, speaker-independent, continuous speech recognition. SPHINX is based on discrete hidden Markov models (HMMs) with LPC- (linear-predictive-coding) derived parameters. To provide speaker independence, knowledge was added to these HMMs in several ways: multiple codebooks of fixed-width parameters, and an enhanced recognizer with carefully designed models and word-duration modeling. To deal with coarticulation in continuous speech, yet still adequately represent a large vocabulary, two new subword speech units are introduced: function-word-dependent phone models and generalized triphone models. With grammars of perplexity 997, 60, and 20, SPHINX attained word accuracies of 71, 94, and 96%, respectively, on a 997-word task

Keywords :

Markov processes; speech recognition; HMM; LPC; SPHINX speech recognition system; coarticulation; continuous speech recognition; fixed-width parameters; function-word-dependent phone models; generalized triphone models; grammars; hidden Markov models; large-vocabulary; linear-predictive-coding; multiple codebooks; speaker-independent; subword speech units; word accuracies; word-duration modeling; Automatic speech recognition; Continuous wavelet transforms; Degradation; Error analysis; Hidden Markov models; Marine vehicles; Mars; Speech recognition; System testing; Vocabulary;

fLanguage :

English

Journal_Title :

Acoustics, Speech and Signal Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

0096-3518

Type :

jour

DOI :

10.1109/29.45616

Filename :

45616

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=787251