DocumentCode :
271975
Title :
Utterance copy through analysis-by-synthesis using genetic algorithm
Author :
Araújo, Fabíola ; Klautau, Aldebaro
Author_Institution :
Signal Process. Lab., Fed. Univ. of Para, Belem, Brazil
fYear :
2014
fDate :
17-20 Aug. 2014
Firstpage :
1
Lastpage :
5
Abstract :
Synthesizing artificial voices with the aim of performing an utterance copy is considered a difficult task even when using a synthesizer based on formants. That is justifiable for example, by the fact that one needs to adjust several input parameters values, with nonlinear relation to the output, to produce a short segment of voice. Furthermore, the naturalness and intelligibility of the artificial voice are aspects to be considered when producing and evaluating the synthesized voice. The main goal of this work is to present the current status of our system based on genetic algorithm to automatically estimate the input parameters of the Klatt synthesizer, and compare its results with the ones obtained withWinsnoori, the only available software that performs the same task. The proposed system results outperforms the baseline by a large margin. For example, in average, the mean square error is reduced to 9.45% of the values obtained with Winsnoori and the PESQ increased from 1.06 to 3.36.
Keywords :
genetic algorithms; mean square error methods; speech intelligibility; speech synthesis; voice equipment; Klatt synthesizer; analysis-by-synthesis; artificial voice intelligibility; artificial voice naturalness; artificial voice synthesis; genetic algorithm; mean square error method; utterance copy; Biological cells; Genetic algorithms; Indexes; Sociology; Speech; Statistics; Synthesizers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Telecommunications Symposium (ITS), 2014 International
Conference_Location :
Sao Paulo
Type :
conf
DOI :
10.1109/ITS.2014.6948053
Filename :
6948053
Link To Document :
بازگشت