• DocumentCode
    310649
  • Title

    Vocabulary optimization based on perplexity

  • Author

    Hwang, Kyuwoong

  • Author_Institution
    Spoken Language Section, Electron. & Telecommun. Res. Inst., Daejeon, South Korea
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    1419
  • Abstract
    We suggest a method to optimize the vocabulary for a given task using the perplexity criterion. The optimization allows us to reduce the size of the vocabulary at the same perplexity of the original word based vocabulary or to reduce perplexity at the same vocabulary size. This new approach is an alternative to phoneme n-gram language model in the speech recognition search stage. We show the convergence of our approach on the Korean training corpus. This method may provide an optimized speech recognizer for a given task. We used phonemes, syllables, morphemes as the basic units for the optimization and reduced the size of the vocabulary to the half of the original word vocabulary size for the morpheme case
  • Keywords
    optimisation; speech processing; speech recognition; Korean training corpus; convergence; morphemes; optimized speech recognizer; perplexity; phonemes; speech recognition search stage; syllables; vocabulary optimization; vocabulary size reduction; Automatic speech recognition; Convergence; Frequency; Interactive systems; Laboratories; Natural languages; Optimization methods; Speech recognition; Statistical analysis; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596214
  • Filename
    596214