مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning non-parametric models of pronunciation

DocumentCode :

2176496

Title :

Learning non-parametric models of pronunciation

Author :

Hutchinson, Brian ; Droppo, Jasha

Author_Institution :

Electr. Eng. Dept., Univ. of Washington, Seattle, WA, USA

fYear :

2011

fDate :

22-27 May 2011

Firstpage :

4904

Lastpage :

4907

Abstract :

As more data becomes available for a given speech recognition task, the natural way to improve recognition accuracy is to train larger models. But, while this strategy yields modest improvements to small systems, the relative gains diminish as the data and models grow. In this paper, we demonstrate that abundant data allows us to model patterns and structure that are unaccounted for in standard systems. In particular, we model the systematic mismatch between the canonical pronunciations of words and the actual pronunciations found in casual or accented speech. Using a combination of two simple data-driven pronunciation models, we can correct 5.2% of the errors in our mobile voice search application.

Keywords :

speech recognition; accented speech; canonical pronunciations; casual speech; data-driven pronunciation models; mobile voice search application; nonparametric models learning; speech recognition task; Acoustics; Context; Context modeling; Data models; Hidden Markov models; Parametric statistics; Speech recognition; Pronunciation model; casual speech; non-parametric model;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location :

Prague

ISSN :

1520-6149

Print_ISBN :

978-1-4577-0538-0

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2011.5947455

Filename :

5947455

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2176496