• DocumentCode
    3486971
  • Title

    Filler model based confidence measures for spoken dialog systems

  • Author

    Akyol, Aydin ; Erdogan, Hakan

  • Author_Institution
    Sabanci Universitesi, Istanbul, Turkey
  • fYear
    2004
  • fDate
    28-30 April 2004
  • Firstpage
    552
  • Lastpage
    555
  • Abstract
    Because of the inadequate performance of speech recognition systems, an accurate confidence scoring mechanism should be employed to understand user requests correctly. To determine a confidence score for a hypothesis, certain confidence features are combined. The performance of filler model based confidence features are investigated. Five types of filler model networks were defined: triphone-network, phone-network, phone-class network, 5-state catch-all model and 3-state catch-all model. First, all the models were evaluated in a Turkish speech recognition task in terms of their ability to tag correctly (recognition-error or correct) recognition hypotheses. The best performance was obtained from the triphone recognition network. Then, the performance of reliable combinations of these models was investigated and it was observed that certain combinations of filler models could significantly improve the accuracy of the confidence annotation.
  • Keywords
    interactive systems; natural language interfaces; speech recognition; 3-state catch-all model; 5-state catch-all model; Turkish speech recognition task; confidence score; filler model based confidence measures; phone-class network; phone-network; recognition hypotheses; spoken dialog systems; triphone-network; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications Applications Conference, 2004. Proceedings of the IEEE 12th
  • Print_ISBN
    0-7803-8318-4
  • Type

    conf

  • DOI
    10.1109/SIU.2004.1338588
  • Filename
    1338588