DocumentCode :
2310997
Title :
Prosody Model for Marathi Language TTS Synthesis with Unit Search and Selection Speech Database
Author :
Repe, Madhavi R. ; Shirbahadurkar, S.D. ; Desai, Smita
Author_Institution :
Electron. Dept., Pad. Dr .D.Y. P.I.E.T., Pune, India
fYear :
2010
fDate :
12-13 March 2010
Firstpage :
362
Lastpage :
364
Abstract :
A Text-To-Speech (TTS) synthesizer is a computer-based system that should be able to read any text aloud, whether it was directly introduced in the computer by an operator or scanned and submitted to an Optical Character Recognition (OCR) system. Systems that simply concatenate isolated words or parts of sentences, denoted as Voice Response Systems, are only applicable when a limited vocabulary is required (typically a few one hundreds of words), and when the sentences to be pronounced respect a very restricted structure, as is the case for the announcement of arrivals in train stations for instance. In the context of TTS synthesis, it is impossible (and luckily useless) to record and store all the words of the language. It is thus more suitable to define Text-To-Speech as the automatic production of speech, through a grapheme-to-phoneme transcription of the sentences to utter. In this paper, we implemented prosody model for Marathi Language TTS synthesis with unit search and selection speech database. Till now TTS for many languages is done like English, Maindrain, Telgu etc. Work on TTS for Marathi Language is also done but not with natural prosody effect. The synthesis technique used is Concatenation to concatenate the words or combination of characters and play corresponding file from database.
Keywords :
database management systems; optical character recognition; speech synthesis; Marathi language TTS synthesis; OCR; grapheme-to-phoneme transcription; optical character recognition; prosody model; selection speech database; speech automatic production; speech database; text-to-speech; unit search; voice response systems; Character recognition; Databases; Dictionaries; Humans; Natural languages; Optical character recognition software; Optical computing; Speech synthesis; Synthesizers; Telecommunication computing; Normalization; Phoneme; Prosody; Punctuations; consonants and vowels;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Recent Trends in Information, Telecommunication and Computing (ITC), 2010 International Conference on
Conference_Location :
Kochi, Kerala
Print_ISBN :
978-1-4244-5956-8
Type :
conf
DOI :
10.1109/ITC.2010.70
Filename :
5460582
Link To Document :
بازگشت