Title :
A syllabic-filler-based continuous speech recognizer for unlimited vocabulary
Author :
El Méliani, R. ; O´Shaughnessy, D.
Author_Institution :
INRS Telecommun., Ile des Soeurs, Que., Canada
Abstract :
Most continuous speech recognition systems make the assumption that the input speech to process is limited to correspond to a text from a given dictionary, while fluent speech is, in fact, characterized by the addition of out-of-dictionary words, hesitations and interrupted words. The authors describe an unlimited vocabulary continuous speech recognizer based on new-word detection and transcription. In the system the words from a pre-defined dictionary as well as the extraneous speech (unknown words) use simply the same context-dependent phoneme hidden Markov models (HMMs) trained on data including only known words. The distinction between dictionary words and unknown words is made during the search by the addition of lexical fillers in the lexical tree used by the two-pass Viterbi type algorithm of the scoring method as well as by modifying the language models. A performance comparison is given for phonemic as well as syllabic fillers
Keywords :
Viterbi detection; hidden Markov models; natural languages; speech recognition; context-dependent phoneme hidden Markov models; extraneous speech; fluent speech; hesitations; input speech; interrupted words; lexical fillers; lexical tree; new-word detection; out-of-dictionary words; performance; phonemic fillers; pre-defined dictionary; scoring method; syllabic-filler-based continuous speech recognizer; transcription; two-pass Viterbi type algorithm; unlimited vocabulary; Delay effects; Dictionaries; Hidden Markov models; Natural languages; Speech processing; Speech recognition; Text processing; Tree graphs; Viterbi algorithm; Vocabulary;
Conference_Titel :
Electrical and Computer Engineering, 1995. Canadian Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
0-7803-2766-7
DOI :
10.1109/CCECE.1995.526599