Invited Talks

Author

Furui, S.

Author_Institution

Tokyo Inst. of Technol., Tokyo, Japan

fYear

2011

fDate

15-17 Nov. 2011

Abstract

More than 6000 living languages are spoken in the world today, and the majority of them are concentrating in Asia. Every language has its own specific acoustic as well as linguistic characteristics that require special modeling techniques. This talk presents our recent experiences in regard to building automatic speech recognition (ASR) systems for the Indonesian, Thai and Chinese languages. For Indonesian, we are building a spoken-query information retrieval (IR) system. In order to solve the problem of a large variation of proper noun and English word pronunciation, we have applied proper noun-specific adaptation in acoustic modeling and rule-based English- to-Indonesian phoneme mapping. For Thai, since there is no word boundary in the written form, we have proposed a new method for automatically creating word-like units from a text corpus, and to recognize spoken style utterances we have applied topic and speaking style adaptation to the language model. In spoken Chinese, long organization names are often abbreviated, and abbreviated utterances cannot be recognized if the abbreviations are not included in the dictionary. We have proposed a new method for automatically generating Chinese abbreviations, and by expanding the vocabulary using the generated abbreviations, we have significantly improved the performance of voice search. This talk includes several recent research activities for the Japanese language.

Keywords

natural language processing; speech recognition; ASR research; Asian languages; English word pronunciation; acoustic modeling; automatic speech recognition systems; linguistic characteristics; special modeling techniques; spoken query information retrieval system;

fLanguage

English

Publisher

ieee

Conference_Titel

Asian Language Processing (IALP), 2011 International Conference on

Conference_Location

Penang

Print_ISBN

978-1-4577-1733-8

Type

conf

DOI

10.1109/IALP.2011.9

Filename

6121453