• DocumentCode
    707576
  • Title

    Development of personalised corpus using open source software

  • Author

    Gahlawat, Mukta ; Bansal, Poonam ; Malik, Amita

  • Author_Institution
    Maharaja Surajmal Inst. of Technol., Delhi, India
  • fYear
    2015
  • fDate
    11-13 March 2015
  • Firstpage
    1853
  • Lastpage
    1858
  • Abstract
    Text to speech synthesis means conversion of written text into spoken words. In order to make speech more natural and intelligible one way is to concatenate the prerecorded units of speech using unit selection algorithm. These units are stored in one place from where they are played back, called speech corpus. There are various standard corpuses available online for testing purposes. But most of the times we need domain specific databases for training and testing our speech synthesizer. For such situations we have discussed here the methodology for developing corpus using open source software. By implementing these series of steps any researcher can create their own database. This will work for emotional as well as normal speech and one can create database of any length. The database developed using this methodology has been tested using concatenative speech synthesis and satisfactory results were obtained.
  • Keywords
    public domain software; speech synthesis; concatenative speech synthesis; open source software; personalised speech corpus; text to speech synthesis; unit selection algorithm; Databases; Open source software; Speech; Speech recognition; Speech synthesis; Synthesizers; Testing; Concatenation Synthesis; Corpus Creation; Emotional Database; Text to Speech System; Unit selection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing for Sustainable Global Development (INDIACom), 2015 2nd International Conference on
  • Conference_Location
    New Delhi
  • Print_ISBN
    978-9-3805-4415-1
  • Type

    conf

  • Filename
    7100566