DocumentCode :
3109704
Title :
Thai ASR development for network-based speech translation
Author :
Wutiwiwatchai, Chai ; Thangthai, K. ; Sertsi, P.
Author_Institution :
Nat. Electron. & Comput. Technol. Center (NECTEC), Pathumthani, Thailand
fYear :
2012
fDate :
9-12 Dec. 2012
Firstpage :
92
Lastpage :
96
Abstract :
A network-based multilingual speech translation service under the Universal Speech Translation Advanced Research (U-STAR) consortium requires a well-tuned Thai automatic speech recognition (ASR) service. This paper summarizes the development of the service by utilizing both Thai read-speech and telephone speech (LOTUS-CELL 2.0) corpora. Tuning is performed regarding different sets of acoustic unit and training data. An evaluation shows that the recognition accuracy of ASR working over data channels can be improved by using the LOTUS-CELL 2.0 corpus although the corpus was constructed via voice channels. The problem of Named-entity (NE) words often found in the working domain is obvious and leads to an urgent future work.
Keywords :
language translation; natural language processing; speech recognition; ASR; LOTUS-CELL 2.0; NE; Thai ASR development; Thai automatic speech recognition service; Thai read-speech; U-STAR; named-entity words; network-based multilingual speech translation service; telephone speech; universal speech translation advanced research; Acoustics; Adaptation models; Data models; Hidden Markov models; Mobile handsets; Speech; Speech recognition; Thai ASR; mobile speech; speech translation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech Database and Assessments (Oriental COCOSDA), 2012 International Conference on
Conference_Location :
Macau
Print_ISBN :
978-1-4673-2811-1
Electronic_ISBN :
978-1-4673-2812-8
Type :
conf
DOI :
10.1109/ICSDA.2012.6422477
Filename :
6422477
Link To Document :
بازگشت