• DocumentCode
    3427371
  • Title

    Detecting tone errors in continuous Mandarin speech

  • Author

    Zhang, Yan-Bin ; Chu, Min ; Huang, Chao ; Liang, Man-Gui

  • Author_Institution
    Inf. Technol. Inst., Beijing Jiaotong Univ., Beijing
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    5065
  • Lastpage
    5068
  • Abstract
    This paper proposes a new approach for detecting tone errors in continuous Mandarin speech. In the training phase, tone variations are modeled with context-depended MSD-HMM which considers six contextual factors instead of two in traditional triphone HMM. In the evaluation phase, the goodness of tone pronunciation is measured by Kullback-Leibler divergence (KLD) between the expected tone model and the most representative tone model. When the KLD between the two models is larger than a threshold, the tone is detected as a pronunciation error. In the ROC curve, we get the equal error rate at 2.6%.
  • Keywords
    hidden Markov models; natural language processing; speech processing; Kullback-Leibler divergence; MSD-HMM; ROC curve; continuous Mandarin speech; multi-space distribution-hidden Markov models; tone error detection; tone model; tone pronunciation; tone variations; Asia; Chaos; Computer errors; Context modeling; Error analysis; Hidden Markov models; Information technology; Natural languages; Phase measurement; Speech; Context Depended Tone Model (CDTM); Kullback-Leibler Divergence (KLD); Tone Error Detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518797
  • Filename
    4518797