DocumentCode :
2253263
Title :
A multi-level lexical-semantics based language model design for guided integrated continuous speech recognition
Author :
Valverde-Albacete, Francisco J. ; Pardo, José M.
Author_Institution :
Area de Tecnologia Electron., Univ. Carlos III, Madrid, Spain
Volume :
1
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
224
Abstract :
We present a continuous speech recognition architecture with a tightly coupled language model that tries to improve the dwindling performance of the normal stack decoder with increasing lexicon size. We solve the problem of recognition by means of two mutually recursive functions. The first one uses an auxiliary retrieval function to obtain lexicalized (already built) solutions to the problem, and merges these solutions with the ones built by the second function. This second one describes the acoustical and semantic recognition process as a search problem defined with the help of the first function, and solved with the help of the A* strategy. As a linguistic model, we use a hierarchy of linguistic levels each of which has a particular meaning structure, a lexicon of lexicalized forms, their lexicalization probabilities, and a local lexical grammar describing how the semantic categories of the level can be built. The process can further be optimized if targets, constraints on the possible solutions, are given to the recognition process to guide and restrict it. Target guidance implies a mechanism for target focusing, locally matching targets to the recognition state, and target prediction with the help of a lexical local grammar. We are testing the architecture in a DARPA RM-like application
Keywords :
computational linguistics; grammars; natural language interfaces; recursive functions; search problems; software performance evaluation; speech recognition; A* strategy; DARPA RM; acoustical recognition; guided integrated continuous speech recognition; language model design; lexicon; lexicon size; linguistic model; local lexical grammar; multi-level lexical-semantics; mutually recursive functions; normal stack decoder; optimization; performance; retrieval function; search problem; semantic recognition; Buildings; Constraint optimization; Cost function; Decoding; Horses; Natural languages; Search problems; Speech recognition; Target recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607082
Filename :
607082
Link To Document :
بازگشت