• DocumentCode
    114256
  • Title

    Multi-level adaptive network for accented Mandarin speech recognition

  • Author

    Huiyong Wang ; Lan Wang ; Xunying Liu

  • Author_Institution
    Shenzhen Inst. of Adv. Technol., Univ. of Chinese Acad. of Sci., Shenzhen, China
  • fYear
    2014
  • fDate
    26-28 April 2014
  • Firstpage
    602
  • Lastpage
    605
  • Abstract
    Accented speech recognition is more challenging than standard speech recognition due to acoustic and linguistic mismatch between standard and accented data. In this paper, we propose a new framework combining Tandem system to improve the discriminative ability of acoustic features with Multi-level Adaptive Network (MLAN) to incorporate information from standard Mandarin corpus and also to solve the data sparseness problem. Mandarin spoken by Guangzhou speakers is considered as the accented mandarin (accented Putonghua, A-PTH), while spoken by northern area as the standard mandarin (standard Putonghua, S-PTH). Significant character error rate reduction of 13.8% and 24.6% relative are obtained over the baseline GMM-HMM systems trained on mixed corpus including both A-PTH and S-PTH corpus, as well as only the A-PTH corpus respectively.
  • Keywords
    Gaussian processes; error statistics; hidden Markov models; natural language processing; speech recognition; A-PTH corpus; Guangzhou speakers; MLAN; Mandarin spoken; S-PTH corpus; Tandem system; accented Mandarin speech recognition; acoustic features; acoustic mismatch; baseline GMM-HMM systems; character error rate reduction; data sparseness problem; discriminative ability; linguistic mismatch; multilevel adaptive network; standard Mandarin corpus; standard Putonghua; Acoustics; Adaptation models; Hidden Markov models; Speech; Speech recognition; Standards; Training; ASR; Tandem system; accented; mandarin; neural network adaptation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science and Technology (ICIST), 2014 4th IEEE International Conference on
  • Conference_Location
    Shenzhen
  • Type

    conf

  • DOI
    10.1109/ICIST.2014.6920550
  • Filename
    6920550