• DocumentCode
    3646037
  • Title

    Discriminative reranking of ASR hypotheses with morpholexical and N-best-list features

  • Author

    Haşim Sak;Murat Saraçlar;Tunga Güngör

  • Author_Institution
    Department of Computer Engineering, Boğ
  • fYear
    2011
  • Firstpage
    202
  • Lastpage
    207
  • Abstract
    This paper explores rich morphological and novel n-best-list features for reranking automatic speech recognition hypotheses. The morpholexical features are defined over the morphological features obtained by using an n-gram language model over lexical and grammatical morphemes in the first-pass. The n-best-list features for each hypothesis are defined using that hypothesis and other alternate hypotheses in an n-best list. Our methodology is to align each hypothesis with other hypotheses one by one using minimum edit distance alignment. This gives us a set of edit operations - substitution, addition and deletion as seen in these alignments. These edit operations constitute our n-best-list features as indicator features. The reranking model is trained using a word error rate sensitive averaged perceptron algorithm introduced in this paper. The proposed methods are evaluated on a Turkish broadcast news transcription task. The baseline systems are word and statistical sub-word systems which also employ morphological features for reranking. We show that morpholexical and n-best-list features are effective in improving the accuracy of the system (0.8%).
  • Keywords
    "Feature extraction","Hidden Markov models","Error analysis","Vectors","Training","Acoustics","Decoding"
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
  • Print_ISBN
    978-1-4673-0365-1
  • Type

    conf

  • DOI
    10.1109/ASRU.2011.6163931
  • Filename
    6163931