• DocumentCode
    2859516
  • Title

    Continuous speech recognition from a phonetic transcription

  • Author

    Levinson, S.E. ; Ljolje, A. ; Miller, L.G.

  • Author_Institution
    AT&T Bell Lab., Murray Hill, NJ, USA
  • fYear
    1990
  • fDate
    3-6 Apr 1990
  • Firstpage
    93
  • Abstract
    A widely accepted linguistic theory holds that speech recognition in humans proceeds from an intermediate representation of the acoustic signal in terms of a small number of phonetic symbols. A novel speech recognition system based on this theory in which the acoustic-to-phonetic mapping is accomplished by means of a particular form of hidden Markov model and is independent of lexical and syntactic constraint is described. Word recognition is then treated as a classical string-to-string editing problem which is solved with a two-level dynamic programming algorithm that accounts for lexical and syntactic structure. The system was tested on speaker-independent recognition of fluent speech from the 991-word DARPA resource management task, on which 76.6% word accuracy was achieved. In informal tests it was observed that the phonetic transcription can be resynthesized to provide a 100-bit/s vocoder with word intelligibility rates of approximately 75%
  • Keywords
    speech analysis and processing; speech recognition; DARPA resource management task; acoustic signal; acoustic-to-phonetic mapping; classical string-to-string editing problem; fluent speech; hidden Markov model; informal tests; intermediate representation; lexical structure; linguistic theory; phonetic transcription; small number of phonetic symbols; speaker-independent recognition; speech recognition system; syntactic structure; two-level dynamic programming algorithm; vocoder; word intelligibility rates; Cepstral analysis; Constraint theory; Dynamic programming; Heuristic algorithms; Hidden Markov models; Humans; Natural languages; Resource management; Signal processing algorithms; Speech processing; Speech recognition; System testing; Vocoders;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
  • Conference_Location
    Albuquerque, NM
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.1990.115545
  • Filename
    115545