DocumentCode :
137162
Title :
A probabilistic approach to selecting units for speech synthesis based on acoustic similarity
Author :
Babu, Ajay ; Krishnan, K. Raghava ; Sao, Anil Kumar ; Murthy, Hema A.
Author_Institution :
Sch. of Comput. & Electr. Eng., Indian Inst. of Technol. Mandi, Mandi, India
fYear :
2014
fDate :
Feb. 28 2014-March 2 2014
Firstpage :
1
Lastpage :
6
Abstract :
Most unit selection synthesisers sound quite natural when the database consists of a number of realisations of the same sound unit from a large number of contexts. A common problem observed with these synthesisers is unexpected prosody when a new context is presented in the text. The objective of this paper is to address this issue and select appropriate units that are relevant to a specific context. Text-to-speech synthesisers propose a number of different features based on the linguistic context to select units. The key contribution in this paper is that the acoustic context rather than the linguistic context is crucial for improving naturalness. A probabilistic framework is proposed for selecting units based on an acoustic framework. Reducing the variability in acoustic context improves both naturalness and intelligibility. Since the context is only specified by acoustics, it can be applied to any language and perhaps even multilingual synthesis. The proposed approach has been tested on 2 Indian languages. An improvement of up to 21.9% in DMOS and 73.93% in WER relative to the conventional system that uses linguistic criteria is observed.
Keywords :
natural language processing; speech synthesis; DMOS; Indian languages; WER; acoustic context; acoustic similarity; linguistic context; multilingual synthesis; text-to-speech synthesisers; unit selection synthesisers; Context; Databases; Pragmatics; Speech; Speech synthesis; Synthesizers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications (NCC), 2014 Twentieth National Conference on
Conference_Location :
Kanpur
Type :
conf
DOI :
10.1109/NCC.2014.6811333
Filename :
6811333
Link To Document :
بازگشت