• DocumentCode
    2660123
  • Title

    A research bed for unit selection based text to speech synthesis

  • Author

    Sarathy, K. Partha ; Ramakrishnan, A.G.

  • Author_Institution
    Centre for Dev. of Telematics, Bangalore
  • fYear
    2008
  • fDate
    15-19 Dec. 2008
  • Firstpage
    229
  • Lastpage
    232
  • Abstract
    The paper describes a modular, unit selection based TTS framework, which can be used as a research bed for developing TTS in any new language, as well as studying the effect of changing any parameter during synthesis. Using this framework, TTS has been developed for Tamil. Synthesis database consists of 1027 phonetically rich pre-recorded sentences. This framework has already been tested for Kannada. Our TTS synthesizes intelligible and acceptably natural speech, as supported by high mean opinion scores. The framework is further optimized to suit embedded applications like mobiles and PDAs. We compressed the synthesis speech database with standard speech compression algorithms used in commercial GSM phones and evaluated the quality of the resultant synthesized sentences. Even with a highly compressed database, the synthesized output is perceptually close to that with uncompressed database. Through experiments, we explored the ambiguities in human perception when listening to Tamil phones and syllables uttered in isolation, thus proposing to exploit the misperception to substitute for missing phone contexts in the database. Listening experiments have been conducted on sentences synthesized by deliberately replacing phones with their confused ones.
  • Keywords
    audio databases; data compression; natural language processing; speech coding; speech intelligibility; speech synthesis; Kannada; Tamil phones; human perception; listening; natural speech; research bed; speech compression algorithms; synthesis database; synthesis speech database; synthesized sentences; text to speech synthesis; uncompressed database; Databases; Digital signal processing; Engines; Natural languages; Signal synthesis; Speech analysis; Speech coding; Speech synthesis; Synthesizers; Testing; intelligibility; naturalness; perception; speech codecs; speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop, 2008. SLT 2008. IEEE
  • Conference_Location
    Goa
  • Print_ISBN
    978-1-4244-3471-8
  • Electronic_ISBN
    978-1-4244-3472-5
  • Type

    conf

  • DOI
    10.1109/SLT.2008.4777882
  • Filename
    4777882