مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

3167453

Title :

Japanese and Korean voice search

Author :

Schuster, Mike ; Nakajima, Kaisuke

Author_Institution :

Google Inc., Mountain View, CA, USA

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

5149

Lastpage :

5152

Abstract :

This paper describes challenges and solutions for building a successful voice search system as applied to Japanese and Korean at Google. We describe the techniques used to deal with an infinite vocabulary, how modeling completely in the written domain for language model and dictionary can avoid some system complexity, and how we built dictionaries, language and acoustic models in this framework. We show how to deal with the difficulty of scoring results for multiple script languages because of ambiguities. The development of voice search for these languages led to a significant simplification of the original process to build a system for any new language which in in parts became our default process for internationalization of voice search.

Keywords :

natural language processing; speech recognition; Japanese voice search system; Korean voice search system; acoustic models; dictionary; language model; multiple script languages; speech recognition; Decision support systems; Helium; Japanese; Korean; Speech recognition; voice search;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6289079

Filename :

6289079

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3167453