DocumentCode :
2909689
Title :
Invited Talks
Author :
Furui, S.
Author_Institution :
Tokyo Inst. of Technol., Tokyo, Japan
fYear :
2011
fDate :
15-17 Nov. 2011
Abstract :
More than 6000 living languages are spoken in the world today, and the majority of them are concentrating in Asia. Every language has its own specific acoustic as well as linguistic characteristics that require special modeling techniques. This talk presents our recent experiences in regard to building automatic speech recognition (ASR) systems for the Indonesian, Thai and Chinese languages. For Indonesian, we are building a spoken-query information retrieval (IR) system. In order to solve the problem of a large variation of proper noun and English word pronunciation, we have applied proper noun-specific adaptation in acoustic modeling and rule-based English- to-Indonesian phoneme mapping. For Thai, since there is no word boundary in the written form, we have proposed a new method for automatically creating word-like units from a text corpus, and to recognize spoken style utterances we have applied topic and speaking style adaptation to the language model. In spoken Chinese, long organization names are often abbreviated, and abbreviated utterances cannot be recognized if the abbreviations are not included in the dictionary. We have proposed a new method for automatically generating Chinese abbreviations, and by expanding the vocabulary using the generated abbreviations, we have significantly improved the performance of voice search. This talk includes several recent research activities for the Japanese language.
Keywords :
natural language processing; speech recognition; ASR research; Asian languages; English word pronunciation; acoustic modeling; automatic speech recognition systems; linguistic characteristics; special modeling techniques; spoken query information retrieval system;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2011 International Conference on
Conference_Location :
Penang
Print_ISBN :
978-1-4577-1733-8
Type :
conf
DOI :
10.1109/IALP.2011.9
Filename :
6121453
Link To Document :
بازگشت