• DocumentCode
    2253263
  • Title

    A multi-level lexical-semantics based language model design for guided integrated continuous speech recognition

  • Author

    Valverde-Albacete, Francisco J. ; Pardo, José M.

  • Author_Institution
    Area de Tecnologia Electron., Univ. Carlos III, Madrid, Spain
  • Volume
    1
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    224
  • Abstract
    We present a continuous speech recognition architecture with a tightly coupled language model that tries to improve the dwindling performance of the normal stack decoder with increasing lexicon size. We solve the problem of recognition by means of two mutually recursive functions. The first one uses an auxiliary retrieval function to obtain lexicalized (already built) solutions to the problem, and merges these solutions with the ones built by the second function. This second one describes the acoustical and semantic recognition process as a search problem defined with the help of the first function, and solved with the help of the A* strategy. As a linguistic model, we use a hierarchy of linguistic levels each of which has a particular meaning structure, a lexicon of lexicalized forms, their lexicalization probabilities, and a local lexical grammar describing how the semantic categories of the level can be built. The process can further be optimized if targets, constraints on the possible solutions, are given to the recognition process to guide and restrict it. Target guidance implies a mechanism for target focusing, locally matching targets to the recognition state, and target prediction with the help of a lexical local grammar. We are testing the architecture in a DARPA RM-like application
  • Keywords
    computational linguistics; grammars; natural language interfaces; recursive functions; search problems; software performance evaluation; speech recognition; A* strategy; DARPA RM; acoustical recognition; guided integrated continuous speech recognition; language model design; lexicon; lexicon size; linguistic model; local lexical grammar; multi-level lexical-semantics; mutually recursive functions; normal stack decoder; optimization; performance; retrieval function; search problem; semantic recognition; Buildings; Constraint optimization; Cost function; Decoding; Horses; Natural languages; Search problems; Speech recognition; Target recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607082
  • Filename
    607082