• DocumentCode
    2659599
  • Title

    Discriminative learning using linguistic features to rescore n-best speech hypotheses

  • Author

    Georgescul, Maria ; Rayner, Manny ; Bouillon, Pierrette ; Tsourakis, Nikos

  • Author_Institution
    ISSCO/TIM, ETI, Univ. of Geneva, Geneva
  • fYear
    2008
  • fDate
    15-19 Dec. 2008
  • Firstpage
    97
  • Lastpage
    100
  • Abstract
    We describe how we were able to improve the accuracy of a medium-vocabulary spoken dialog system by rescoring the list of n-best recognition hypotheses using a combination of acoustic, syntactic, semantic and discourse information. The non-acoustic features are extracted from different intermediate processing results produced by the natural language processing module, and automatically filtered. We apply discriminative support vector learning designed for re-ranking, using both word error rate and semantic error rate as ranking target value, and evaluating using five-fold cross-validation; to show robustness of our method, confidence intervals for word and semantic error rates are computed via bootstrap sampling. The reduction in semantic error rate, from 19% to 11%, is statistically significant at 0.01 level.
  • Keywords
    learning (artificial intelligence); natural language processing; speech recognition; support vector machines; discriminative support vector learning; linguistic features; medium-vocabulary spoken dialog system; n-best speech recognition hypotheses; natural language processing; Automatic speech recognition; Calendars; Data mining; Databases; Error analysis; Feature extraction; Natural languages; Power system modeling; Speech recognition; Vocabulary; natural language; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop, 2008. SLT 2008. IEEE
  • Conference_Location
    Goa
  • Print_ISBN
    978-1-4244-3471-8
  • Electronic_ISBN
    978-1-4244-3472-5
  • Type

    conf

  • DOI
    10.1109/SLT.2008.4777849
  • Filename
    4777849