Development of personalised corpus using open source software

Author

Gahlawat, Mukta ; Bansal, Poonam ; Malik, Amita

Author_Institution

Maharaja Surajmal Inst. of Technol., Delhi, India

fYear

2015

fDate

11-13 March 2015

Firstpage

1853

Lastpage

1858

Abstract

Text to speech synthesis means conversion of written text into spoken words. In order to make speech more natural and intelligible one way is to concatenate the prerecorded units of speech using unit selection algorithm. These units are stored in one place from where they are played back, called speech corpus. There are various standard corpuses available online for testing purposes. But most of the times we need domain specific databases for training and testing our speech synthesizer. For such situations we have discussed here the methodology for developing corpus using open source software. By implementing these series of steps any researcher can create their own database. This will work for emotional as well as normal speech and one can create database of any length. The database developed using this methodology has been tested using concatenative speech synthesis and satisfactory results were obtained.

Keywords

public domain software; speech synthesis; concatenative speech synthesis; open source software; personalised speech corpus; text to speech synthesis; unit selection algorithm; Databases; Open source software; Speech; Speech recognition; Speech synthesis; Synthesizers; Testing; Concatenation Synthesis; Corpus Creation; Emotional Database; Text to Speech System; Unit selection;

fLanguage

English

Publisher

ieee

Conference_Titel

Computing for Sustainable Global Development (INDIACom), 2015 2nd International Conference on

Conference_Location

New Delhi

Print_ISBN

978-9-3805-4415-1

Type

conf

Filename

7100566