• DocumentCode
    706296
  • Title

    Automatic language recognition with tonal and non-tonal language pre-classification

  • Author

    Liang Wang ; Ambikairajah, Eliathamby ; Choi, Eric H. C.

  • Author_Institution
    Sch. of EE&Telecomm., Univ. of New South Wales, Sydney, NSW, Australia
  • fYear
    2007
  • fDate
    3-7 Sept. 2007
  • Firstpage
    2375
  • Lastpage
    2379
  • Abstract
    Parallel Phoneme Recognition followed by Language Modelling (PPRLM) systems currently provide state of the art language identification performance on conversational telephone speech. In this paper an innovative method for tonal and non-tonal language pre-classification by using prosodie information is reported. Our motivation is to improve recognition accuracy and save the amount of CPU run-time while handling large number of languages. Also, by incorporating different confidence measures into the traditional PPRLM framework, we propose an optimized language recognition system that can be applied in an open-set language recognition task. For a task of 12 target languages and 4 non-target languages, our results show that with the optimized pre-classification, Universal Background Phone Model confidence measuring and Witten-Bell discounting the system can achieve recognition accuracy rates of 77.9% for 30-sec speech segments and 49.2% for 10-sec speech segments.
  • Keywords
    natural language processing; speech recognition; CPU run-time; PPRLM systems; Witten-Bell discounting system; automatic language recognition; conversational telephone speech; language identification performance; nontarget languages; nontonal language preclassification; open-set language recognition task; optimized language recognition system; optimized preclassification; parallel phoneme recognition followed-by-language modelling; prosodic information; recognition accuracy improvement; recognition accuracy rates; speech segments; target languages; tonal language preclassification; universal background phone model confidence measure; Accuracy; Acoustics; Europe; Feature extraction; Hidden Markov models; Speech; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2007 15th European
  • Conference_Location
    Poznan
  • Print_ISBN
    978-839-2134-04-6
  • Type

    conf

  • Filename
    7099233