مرکز منطقه ای اطلاع رساني علوم و فناوري - Unsupervised acoustic model training: Comparing South African English and isiZulu

DocumentCode :

3714224

Title :

Unsupervised acoustic model training: Comparing South African English and isiZulu

Author :

Neil Kleynhans;Febe de Wet;Etienne Barnard

Author_Institution :

Multilingual Speech Technologies, North-West University, Vanderbijlpark, South Africa

fYear :

2015

Firstpage :

136

Lastpage :

141

Abstract :

Large amounts of untranscribed audio data are generated every day. These audio resources can be used to develop robust acoustic models that can be used in a variety of speech-based systems. Manually transcribing this data is resource intensive and requires funding, time and expertise. Lightly-supervised training techniques, however, provide a means to rapidly transcribe audio, thus reducing the initial resource investment to begin the modelling process. Our findings suggest that the lightly-supervised training technique works well for English but when moving to an agglutinative language, such as isiZulu, the process fails to achieve the performance seen for English. Additionally, phone-based performances are significantly worse when compared to an approach using word-based language models. These results indicate a strong dependence on large or well-matched text resources for lightly-supervised training techniques.

Keywords :

"Training","Acoustics","Data models","Hidden Markov models","Speech","Dictionaries","Speech recognition"

Publisher :

ieee

Conference_Titel :

Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech), 2015

Type :

conf

DOI :

10.1109/RoboMech.2015.7359512

Filename :

7359512

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3714224