مرکز منطقه ای اطلاع رساني علوم و فناوري - HMM Adaptation for child speech synthesis using ASR data

DocumentCode :

3714231

Title :

HMM Adaptation for child speech synthesis using ASR data

Author :

Avashna Govender;Bakari Nouhou;Febe de Wet

Author_Institution :

Human Language Technology Research Group at the CSIR in Pretoria, South Africa

fYear :

2015

Firstpage :

178

Lastpage :

183

Abstract :

Acquiring large amounts of child speech data is a particularly difficult task. One could therefore consider the possibility to add existing corpora of child speech data to the severely limited resources that are available for developing child voices. This paper reports on a feasibility study that was conducted to determine whether it is possible to synthesize good quality child voices using child speech data that was recorded for automatic speech recognition (ASR) purposes. A text-to-speech system was implemented using hidden Markov model based synthesis since it has proven to be a technique that is less susceptible to imperfect data. The paper describes how data was selected from the ASR corpus to build various child voices. The voices were evaluated to determine whether the data selection methods yield acceptable results within the context of model adaptation for child speech synthesis. The results show that, if data is selected according to particular criteria, ASR data could be used to develop child voices that are comparable to voices that were built using speech data specifically recorded for speech synthesis purposes.

Keywords :

"Speech","Hidden Markov models","Adaptation models","Data models","Speech synthesis","Noise measurement","Speech recognition"

Publisher :

ieee

Conference_Titel :

Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech), 2015

Type :

conf

DOI :

10.1109/RoboMech.2015.7359519

Filename :

7359519

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3714231