• DocumentCode
    3124072
  • Title

    Alternative hypothesis generation using a weighted kernel feature matrix for ASR substitution error correction

  • Author

    Chao-Hong Liu ; Chung-Hsien Wu ; Sarwono, D.

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
  • fYear
    2012
  • fDate
    5-8 Dec. 2012
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Although automatic speech recognition (ASR) has been successfully used in several applications, it is still non-robust and imprecise especially in a harsh environment wherein the input speech is of low quality. Robust error correction for ASR outputs thus becomes important in addition to improving recognition performance. In recent approaches to error correction, linguistic or domain information is used to generate the alternative hypotheses for the ASR outputs followed by the selection of the most likely alternative. In this study, the distances between ASR outputs and the potentially correct alternatives are estimated based on a weighted context-dependent syllable cluster-based kernel feature matrix followed by multidimensional scaling (MDS)-based distance rescaling. These distances are then used to construct an alternative syllable lattice and the dynamic programming is used to obtain the most likely correct output with respect to the original ASR results. Experiments show that the proposed method achieved about 1.95% improvement on the word error rate compared to the correction pair approach using the MATBN Mandarin Chinese broadcast news corpus.
  • Keywords
    dynamic programming; error correction; linguistics; matrix algebra; speech recognition; ASR substitution error correction; MATBN Mandarin Chinese broadcast news corpus; alternative hypothesis generation; automatic speech recognition; distance rescaling; domain information; dynamic programming; linguistic; multidimensional scaling; robust error correction; syllable lattice; weighted context-dependent syllable cluster; weighted kernel feature matrix; Decision trees; Error analysis; Error correction; Kernel; Lattices; Speech; Speech recognition; ASR substitution error; MDS-based distance rescaling; context-dependent syllable cluster; error correction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
  • Conference_Location
    Kowloon
  • Print_ISBN
    978-1-4673-2506-6
  • Electronic_ISBN
    978-1-4673-2505-9
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2012.6423475
  • Filename
    6423475