• DocumentCode
    3165829
  • Title

    Acoustic data-driven grapheme-to-phoneme conversion using KL-HMM

  • Author

    Rasipuram, Ramya ; Doss, Mathew Magimai

  • Author_Institution
    Idiap Res. Inst., Martigny, Switzerland
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4841
  • Lastpage
    4844
  • Abstract
    This paper proposes a novel grapheme-to-phoneme (G2P) conversion approach where first the probabilistic relation between graphemes and phonemes is captured from acoustic data using Kullback-Leibler divergence based hidden Markov model (KL-HMM) system. Then, through a simple decoding framework the information in this probabilistic relation is integrated with the sequence information in the orthographic transcription of the word to infer the phoneme sequence. One of the main application of the proposed G2P approach is in the area of low linguistic resource based automatic speech recognition or text-to-speech systems. We demonstrate this potential through a simulation study where linguistic resources from one domain is used to create linguistic resources for a different domain.
  • Keywords
    decoding; hidden Markov models; speech recognition; speech synthesis; KL-HMM; Kullback Leibler divergence; acoustic data driven; automatic speech recognition; decoding framework; grapheme-to-phoneme conversion; hidden Markov model; linguistic resources; probabilistic relation; sequence information; text-to-speech systems; Acoustics; Biological system modeling; Context; Context modeling; Dictionaries; Hidden Markov models; Pragmatics; Kullback-Leibler divergence based HMM; Lexicon; grapheme; grapheme-to-phoneme converter; multilayer perceptron; phoneme;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6289003
  • Filename
    6289003