• DocumentCode
    2229397
  • Title

    A bottom-up approach for handling unseen triphones in large vocabulary continuous speech recognition

  • Author

    Aubert, Xavier ; Beyerlein, Peter ; Ullrich, Meinhard

  • Author_Institution
    Philips GmbH Forschungslab., Aachen, Germany
  • Volume
    1
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    14
  • Abstract
    Presents an extension of bottom-up state-tying towards improved handling of unseen triphones. As opposed to the usual backing-off to diphones and monophones, the current method aims at finding a triphone model that has proven to exhibit some similarity with the unseen triphone. It is based on a probabilistic mapping of unseen contexts to clusters of triphone states observed in the training data. This algorithm has been applied to dictation tasks for three languages with vocabulary sizes ranging from 20k to 64k. The results compare favorably with those obtained using standard back-off rules. This technique also offers an alternative to top-down decision-tree procedures which are frequently used, especially for their generalization capabilities
  • Keywords
    dictation; hidden Markov models; probability; speech recognition; vocabulary; back-off rules; backing-off; bottom-up state-tying; dictation tasks; generalization capabilities; languages; large-vocabulary continuous speech recognition; probabilistic mapping; similarity; top-down decision-tree procedures; training data; triphone state clusters; unseen triphones; vocabulary size; Buildings; Clustering algorithms; Context modeling; Databases; Decision trees; Decoding; Laboratories; Speech recognition; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.606918
  • Filename
    606918