• DocumentCode
    3244408
  • Title

    Automatic pronunciation modelling for multiple non-native accents

  • Author

    Goronzy, Silke ; Eisele, Kathrin

  • Author_Institution
    Sony Corporate Labs Eur. - Adv. Software Lab, Sony Int. (Eur.) GmbH, Stuttgart, Germany
  • fYear
    2003
  • fDate
    30 Nov.-3 Dec. 2003
  • Firstpage
    123
  • Lastpage
    128
  • Abstract
    This paper describes an automatic method for generating non-native pronunciations and its combination with speaker adaptation to solve the problem of a performance decrease if state-of-the-art speech recognisers are faced with non-native speech. Although being a data-driven approach it overcomes the problem of gathering accented speech data for deriving the non-native variants. It rather uses solely native speech of both languages $the language the speaker is speaking as well as his/her mother tongue. Our experiments showed that using the generated accented variants to enhance the standard pronunciation dictionary in combination with weighted MLLR speaker adaptation outperforms the baseline system by up to 20% and speaker adaptation alone by up to 3%. The approach was tested on English-accented German and on German-accented French and results were consistent throughout both languages. Since our approach relies on native speech data only, it can easily be extended to various accents for different languages.
  • Keywords
    speech processing; speech recognition; English-accented German; German-accented French; automatic pronunciation modelling; data-driven approach; multiple nonnative accents; nonnative speech; performance decrease; pronunciation dictionary; speaker adaptation; state-of-the-art speech recognisers; weighted MLLR speaker adaptation; Automatic speech recognition; Databases; Dictionaries; Europe; Face recognition; Hidden Markov models; Loudspeakers; Natural languages; Software performance; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
  • Print_ISBN
    0-7803-7980-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2003.1318415
  • Filename
    1318415