Title :
Pitch and duration modification for expressive speech synthesis in Marathi TTS system
Author :
Anil, Manjare Chandraprabha ; Shirbahadurkar, S.D.
Author_Institution :
Dept. of Electron. & Telecommun. Eng., JSPM´s Rajarshi Shahu Coll. of Eng., Pune, India
Abstract :
Generating expressive synthetic speech is very important in high quality Marathi Text-to-Speech (TTS) system. This paper focuses on voice conversion and modification technique maintaining acceptable quality and naturalness with reduced database. In this paper, a method to modify fundamental frequency contour is proposed for Marathi TTS system. The naturalness of the speech is highly correlated to phonetic description and prosodic features such as Fundamental frequency and duration of that phone. For prosody generation, we have obtained a primary pitch curve for the word, based on the location followed by punctuation marks. Question mark and exclamation mark in the text are studied to modify prosody. Phase-Vocoder technique can be used to improve the prosody of the synthesized speech. The experimental results showed that the proposed prosody modification, based on pitch and duration modification can improve speech quality.
Keywords :
speech synthesis; vocoders; Marathi TTS system; Marathi text-to-speech system; duration modification; expressive speech synthesis; fundamental frequency contour; phase-vocoder technique; phonetic description; pitch modification; primary pitch curve; prosodic features; punctuation marks; speech quality improvement; voice conversion; Algorithm design and analysis; Hidden Markov models; Mathematical model; Speech; Speech synthesis; Vocoders; Fundamental frequency; Phase-Vocoder; Pitch Contour; Prosody; Speech synthesis;
Conference_Titel :
Pervasive Computing (ICPC), 2015 International Conference on
Conference_Location :
Pune
DOI :
10.1109/PERVASIVE.2015.7086977