DocumentCode :
310649
Title :
Vocabulary optimization based on perplexity
Author :
Hwang, Kyuwoong
Author_Institution :
Spoken Language Section, Electron. & Telecommun. Res. Inst., Daejeon, South Korea
Volume :
2
fYear :
1997
fDate :
21-24 Apr 1997
Firstpage :
1419
Abstract :
We suggest a method to optimize the vocabulary for a given task using the perplexity criterion. The optimization allows us to reduce the size of the vocabulary at the same perplexity of the original word based vocabulary or to reduce perplexity at the same vocabulary size. This new approach is an alternative to phoneme n-gram language model in the speech recognition search stage. We show the convergence of our approach on the Korean training corpus. This method may provide an optimized speech recognizer for a given task. We used phonemes, syllables, morphemes as the basic units for the optimization and reduced the size of the vocabulary to the half of the original word vocabulary size for the morpheme case
Keywords :
optimisation; speech processing; speech recognition; Korean training corpus; convergence; morphemes; optimized speech recognizer; perplexity; phonemes; speech recognition search stage; syllables; vocabulary optimization; vocabulary size reduction; Automatic speech recognition; Convergence; Frequency; Interactive systems; Laboratories; Natural languages; Optimization methods; Speech recognition; Statistical analysis; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
ISSN :
1520-6149
Print_ISBN :
0-8186-7919-0
Type :
conf
DOI :
10.1109/ICASSP.1997.596214
Filename :
596214
Link To Document :
بازگشت