DocumentCode :
3399022
Title :
Design issues in developing speech corpus for Indian languages — A survey
Author :
Kiruthiga, S. ; Krishnamoorthy, K.
Author_Institution :
Anna Univ. of Technol. Coimbatore, Coimbatore, India
fYear :
2012
fDate :
10-12 Jan. 2012
Firstpage :
1
Lastpage :
4
Abstract :
Any spoken language system, it may either be a speech synthesis or a speech recognition system, starts with building a speech corpora. We give a detailed survey of issues in building a speech corpus for Indian languages. To begin with, an appropriate text file should be selected for building the speech corpus. Then a corresponding speech file is generated and stored. This speech file is the phonetic representation of the selected text file. The speech file is processed in different levels viz., paragraphs, sentences, phrases, words, syllables and phones. These are called the speech units of the file. Researches have been done taking these units as the basic unit for processing. This paper analyses the researches done using phones, diphones, triphones, syllables and polysyllables as their basic unit for speech synthesis. Concatenative speech synthesis involves the concatenation of these basic units to synthesize a natural sounding speech. The speech units are added with some more relevnt information about each unit, manually or automatically, based on an algorithm. The database consisting of the units along with their associated information is called as the speech corpus. Techniques that are used in the database to improve the intelligibility of the synthesized speech in Speech synthesis system are also surveyed.
Keywords :
natural language processing; speech recognition; speech synthesis; Indian languages; design issues; diphone; file speech units; paragraphs; phrases; polysyllable; sentences; speech corpus development; speech file; speech recognition system; speech synthesis; spoken language system; text file phonetic representation; triphone; words; Buildings; Computers; Databases; Speech; Speech recognition; Speech synthesis; Concatenative Speech synthesis; Indian languages; Speech corpus;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Communication and Informatics (ICCCI), 2012 International Conference on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4577-1580-8
Type :
conf
DOI :
10.1109/ICCCI.2012.6158831
Filename :
6158831
Link To Document :
بازگشت