• DocumentCode
    1998056
  • Title

    Improving Korean LVCSR with Long-Time Temporal Patterns and an Extended Phoneme Set

  • Author

    Ji Xu ; Zhen Zhang ; Qingqing Zhang ; Jielin Pan ; Yonghong Yan

  • Author_Institution
    Key Lab. of Speech Acoust. & Content Understanding, Beijing, China
  • fYear
    2013
  • fDate
    3-4 Dec. 2013
  • Firstpage
    336
  • Lastpage
    340
  • Abstract
    Korean is an agglutinative language, in which pronunciations are affected by long-term context. In this paper, the long-time temporal information is investigated to improve Korean LVCSR. TRAP-based MLP features, which are able to utilize the scattered acoustic information over several hundred milliseconds, are employed to obtain additional information besides the conventional cepstral features. In contrast to the traditional Korean phoneme set, in which consonants in the initial and final positions are taken as the same, a more specific phoneme set is constructed via taking consonants as position dependent. In the Korean broadcast news speech recognition task, experiments show that with these improvements the character error rate has been reduced by 25.3% relatively over the baseline system.
  • Keywords
    acoustic signal processing; natural language processing; speech recognition; Korean LVCSR; Korean broadcast news speech recognition task; Korean phoneme set; TRAP-based MLP features; agglutinative language; baseline system; character error rate; consonants; extended phoneme set; long-term context; long-time temporal information; long-time temporal patterns; scattered acoustic information; Acoustics; Context; Dictionaries; Feature extraction; Hidden Markov models; Speech; Speech recognition; Korean LVCSR; agglutinative language; long-time temporal information; phoneme set;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems (GCIS), 2013 Fourth Global Congress on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4799-2885-9
  • Type

    conf

  • DOI
    10.1109/GCIS.2013.60
  • Filename
    6805957