DocumentCode
2859516
Title
Continuous speech recognition from a phonetic transcription
Author
Levinson, S.E. ; Ljolje, A. ; Miller, L.G.
Author_Institution
AT&T Bell Lab., Murray Hill, NJ, USA
fYear
1990
fDate
3-6 Apr 1990
Firstpage
93
Abstract
A widely accepted linguistic theory holds that speech recognition in humans proceeds from an intermediate representation of the acoustic signal in terms of a small number of phonetic symbols. A novel speech recognition system based on this theory in which the acoustic-to-phonetic mapping is accomplished by means of a particular form of hidden Markov model and is independent of lexical and syntactic constraint is described. Word recognition is then treated as a classical string-to-string editing problem which is solved with a two-level dynamic programming algorithm that accounts for lexical and syntactic structure. The system was tested on speaker-independent recognition of fluent speech from the 991-word DARPA resource management task, on which 76.6% word accuracy was achieved. In informal tests it was observed that the phonetic transcription can be resynthesized to provide a 100-bit/s vocoder with word intelligibility rates of approximately 75%
Keywords
speech analysis and processing; speech recognition; DARPA resource management task; acoustic signal; acoustic-to-phonetic mapping; classical string-to-string editing problem; fluent speech; hidden Markov model; informal tests; intermediate representation; lexical structure; linguistic theory; phonetic transcription; small number of phonetic symbols; speaker-independent recognition; speech recognition system; syntactic structure; two-level dynamic programming algorithm; vocoder; word intelligibility rates; Cepstral analysis; Constraint theory; Dynamic programming; Heuristic algorithms; Hidden Markov models; Humans; Natural languages; Resource management; Signal processing algorithms; Speech processing; Speech recognition; System testing; Vocoders;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
Conference_Location
Albuquerque, NM
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.1990.115545
Filename
115545
Link To Document