• DocumentCode
    454673
  • Title

    Sub-Phonetic Modeling For Capturing Pronunciation Variations For Conversational Speech Synthesis

  • Author

    Prahallad, Kishore ; Black, Alan W. ; Mosur, Ravishankhar

  • Author_Institution
    Inst. of Language Technol., Carnegie Mellon Univ.
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    In this paper we address the issue of pronunciation modeling for conversational speech synthesis. We experiment with two different HMM topologies (fully connected state model and forward connected state model) for sub-phonetic modeling to capture the deletion and insertion of sub-phonetic states during speech production process. We show that the experimented HMM topologies have higher log likelihood than the traditional 5-state sequential model. We also study the first and second mentions of content words and their influence on the pronunciation variation. Finally we report phone recognition experiments using the modified HMM topologies
  • Keywords
    hidden Markov models; speech processing; speech recognition; speech synthesis; HMM topologies; conversational speech synthesis; forward connected state model; phone recognition; pronunciation variations; speech production process; state sequential model; sub-phonetic modeling; Context modeling; Databases; Hidden Markov models; Natural languages; Neural networks; Predictive models; Speech processing; Speech recognition; Speech synthesis; Topology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660155
  • Filename
    1660155