Title :
YAST: A Scalable ASR Toolkit Especially Designed for Under-Resourced Languages
Author :
Ferreira, Eija ; Nocera, P. ; Goudi, M. ; Thi, N.D.D.
Author_Institution :
Lab. Inf. d´´Avignon, LIA, Avignon, France
Abstract :
The ability to collect and process a large amount of resources (e.g. vocabularies, text corpora, transcribed speech corpora and phonetic dictionaries) constitutes a critical prerequisite of systems based on statistical methods. This aspect becomes crucial for languages presenting a lack of computer resources, also known as under-resourced languages, such as Vietnamese. Our work consists in exploring an efficient methodology which can help the development of speech recognition systems for this kind of languages. This article presents a possible solution that provides a fast building and customisable ASR toolkit, called YAST. The latter includes an ASR library as well as a collection of C++/Java executable programs and some helper bash and perl scripts. These utilities allow on one hand, to build and evaluate an ASR system, on the other, to provide programming development hooks that permit to include state of the art techniques. YAST is freely available for non-commercial purposes. This paper summarizes the functionality of the toolkit and also provides a basic example carried out on the Vietnamese language.
Keywords :
Java; natural language processing; speech recognition; statistical analysis; text analysis; ASR library; ASR toolkit; C++ executable programs; Java executable programs; Vietnamese language; YAST; automatic speech recognition systems; phonetic dictionaries; programming development hooks; statistical methods; text corpora; transcribed speech corpora; under-resourced languages; vocabularies; Acoustics; Buildings; Context modeling; Estimation; Hidden Markov models; Speech; Speech recognition; SPEERAL; Speech recognition; YAST; fast building ASR; pi-project; under-resourced languages;
Conference_Titel :
Asian Language Processing (IALP), 2012 International Conference on
Conference_Location :
Hanoi
Print_ISBN :
978-1-4673-6113-2
Electronic_ISBN :
978-0-7695-4886-9
DOI :
10.1109/IALP.2012.65