Continuous speech recognition from a phonetic transcription

Author

Levinson, S.E. ; Ljolje, A. ; Miller, L.G.

Author_Institution

AT&T Bell Lab., Murray Hill, NJ, USA

fYear

1990

fDate

3-6 Apr 1990

Firstpage

93

Abstract

A widely accepted linguistic theory holds that speech recognition in humans proceeds from an intermediate representation of the acoustic signal in terms of a small number of phonetic symbols. A novel speech recognition system based on this theory in which the acoustic-to-phonetic mapping is accomplished by means of a particular form of hidden Markov model and is independent of lexical and syntactic constraint is described. Word recognition is then treated as a classical string-to-string editing problem which is solved with a two-level dynamic programming algorithm that accounts for lexical and syntactic structure. The system was tested on speaker-independent recognition of fluent speech from the 991-word DARPA resource management task, on which 76.6% word accuracy was achieved. In informal tests it was observed that the phonetic transcription can be resynthesized to provide a 100-bit/s vocoder with word intelligibility rates of approximately 75%

Keywords

speech analysis and processing; speech recognition; DARPA resource management task; acoustic signal; acoustic-to-phonetic mapping; classical string-to-string editing problem; fluent speech; hidden Markov model; informal tests; intermediate representation; lexical structure; linguistic theory; phonetic transcription; small number of phonetic symbols; speaker-independent recognition; speech recognition system; syntactic structure; two-level dynamic programming algorithm; vocoder; word intelligibility rates; Cepstral analysis; Constraint theory; Dynamic programming; Heuristic algorithms; Hidden Markov models; Humans; Natural languages; Resource management; Signal processing algorithms; Speech processing; Speech recognition; System testing; Vocoders;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on

Conference_Location

Albuquerque, NM

ISSN

1520-6149

Type

conf

DOI

10.1109/ICASSP.1990.115545

Filename

115545