DocumentCode :
3714231
Title :
HMM Adaptation for child speech synthesis using ASR data
Author :
Avashna Govender;Bakari Nouhou;Febe de Wet
Author_Institution :
Human Language Technology Research Group at the CSIR in Pretoria, South Africa
fYear :
2015
Firstpage :
178
Lastpage :
183
Abstract :
Acquiring large amounts of child speech data is a particularly difficult task. One could therefore consider the possibility to add existing corpora of child speech data to the severely limited resources that are available for developing child voices. This paper reports on a feasibility study that was conducted to determine whether it is possible to synthesize good quality child voices using child speech data that was recorded for automatic speech recognition (ASR) purposes. A text-to-speech system was implemented using hidden Markov model based synthesis since it has proven to be a technique that is less susceptible to imperfect data. The paper describes how data was selected from the ASR corpus to build various child voices. The voices were evaluated to determine whether the data selection methods yield acceptable results within the context of model adaptation for child speech synthesis. The results show that, if data is selected according to particular criteria, ASR data could be used to develop child voices that are comparable to voices that were built using speech data specifically recorded for speech synthesis purposes.
Keywords :
"Speech","Hidden Markov models","Adaptation models","Data models","Speech synthesis","Noise measurement","Speech recognition"
Publisher :
ieee
Conference_Titel :
Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech), 2015
Type :
conf
DOI :
10.1109/RoboMech.2015.7359519
Filename :
7359519
Link To Document :
بازگشت