• DocumentCode
    1849171
  • Title

    ASR domain adaptation methods for low-resourced languages: Application to Romanian language

  • Author

    Cucu, Horia ; Besacier, Laurent ; Burileanu, Corneliu ; Buzo, Andi

  • Author_Institution
    Univ. Politeh. of Bucharest, Bucharest, Romania
  • fYear
    2012
  • fDate
    27-31 Aug. 2012
  • Firstpage
    1648
  • Lastpage
    1652
  • Abstract
    This study investigates the possibility of using statistical machine translation to create domain-specific language resources. We propose a methodology that aims to create a domain-specific automatic speech recognition system for a low-resourced language when in-domain text corpora are available only in a high-resourced language. We evaluate a new semi-supervised method and compare it with previously developed semi-supervised and unsupervised approaches. Moreover, in the effort of creating an out-of-domain language model for Romanian, we introduce and experiment an effective diacritics restoration algorithm.
  • Keywords
    language translation; learning (artificial intelligence); natural language processing; speech recognition; statistical analysis; ASR domain adaptation methods; Romanian language; domain specific language resource; low-resourced languages; semisupervised method; speech recognition system; statistical machine translation; text corpora; Acoustics; Adaptation models; Automatic speech recognition; Context; Hidden Markov models; Probabilistic logic; Speech; ASR domain adaptation; SMT; diacritics restoration; language modeling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
  • Conference_Location
    Bucharest
  • ISSN
    2219-5491
  • Print_ISBN
    978-1-4673-1068-0
  • Type

    conf

  • Filename
    6333947