• DocumentCode
    976327
  • Title

    Structural methods in automatic speech recognition

  • Author

    Levinson, Stephen E.

  • Author_Institution
    AT & T Bell Laboratories, Murray Hill, NJ, USA
  • Volume
    73
  • Issue
    11
  • fYear
    1985
  • Firstpage
    1625
  • Lastpage
    1650
  • Abstract
    The past decade has witnessed substantial progress toward the goal of constructing a machine capable of understanding colloquial discourse. Central to this progress has been the development and application of mathematical methods that permit modeling the speech signal as a complex code with several coexisting levels of structure. The most successful of these are "template matching," stochastic modeling, and probabilistic parsing. The manifestation of common themes such as dynamic programming and finite-state descriptions accentuates a superficial likeness amongst the methods which is often mistaken for the deeper similarity arising from their shared Bayesian foundation. In this paper, we outline the mathematical bases of these methods, invariant metrics, hidden Markov chains, and formal grammars, respectively. We then recount and briefly interpret the results of experiments in speech recognition to which the various methods were applied. Since these mathematical principles seem to bear little resemblance to traditional linguistic characterizations of speech, the success of the experiments is occasionally attributed, even by their authors, merely to excellent engineering. We conclude by speculating that, quite to the contrary, these methods actually constitute a powerful theory of speech that can be reconciled with and elucidate conventional linguistic theories while being used to build truly competent mechanical speech recognizers.
  • Keywords
    Automatic speech recognition; Bayesian methods; Dynamic programming; Hidden Markov models; Mathematical model; Natural languages; Power engineering and energy; Speech coding; Speech recognition; Stochastic processes;
  • fLanguage
    English
  • Journal_Title
    Proceedings of the IEEE
  • Publisher
    ieee
  • ISSN
    0018-9219
  • Type

    jour

  • DOI
    10.1109/PROC.1985.13344
  • Filename
    1457612