DocumentCode :
3723547
Title :
Development of Assamese Text-to-speech synthesis system
Author :
Bidisha Sharma;Nagaraj Adiga;S. R. Mahadeva Prasanna
Author_Institution :
Dept. of Electronics and Electrical Engineering, Indian Institute of Technology Guwahati, India
fYear :
2015
Firstpage :
1
Lastpage :
6
Abstract :
This paper presents the design and development of Assamese Text to speech (TTS) synthesis system. In particular, work focused on designing language specific rules, developing quality database, data segmentation, and to handle bilingual sound units. In Assamese language, till now no study is done to construct the grapheme to phoneme conversion rules. In this work, grapheme to phoneme conversion rules are proposed for Assamese language. The database is recorded by checking the speaking rate, variation in amplitude level, dc wandering, and clipping during data collection. A significant improvement in the synthesized voice is observed by ensuring uniform speaking rate, controlling variation in the signal amplitude level, and avoiding dc wandering and clipping during data collection. A semi-automatic segmentation approach is developed for data segmentation. Initially, segmentation is done by automatic process and later manual correction of segmentation boundaries is done to improve quality and intelligibility. It also reduce time required for the segmentation process. The developed TTS can work in bilingual mode. It can switch between Assamese and English language smoothly and maintains the sentence level intonation even for mixed texts.
Keywords :
"Speech","Databases","Hidden Markov models","High-temperature superconductors","Data collection","Buildings","Switches"
Publisher :
ieee
Conference_Titel :
TENCON 2015 - 2015 IEEE Region 10 Conference
ISSN :
2159-3442
Print_ISBN :
978-1-4799-8639-2
Electronic_ISBN :
2159-3450
Type :
conf
DOI :
10.1109/TENCON.2015.7372786
Filename :
7372786
Link To Document :
بازگشت