مرکز منطقه ای اطلاع رساني علوم و فناوري - Investigations on exemplar-based features for speech recognition towards thousands of hours of unsupervised, noisy data

DocumentCode :

3163697

Title :

Investigations on exemplar-based features for speech recognition towards thousands of hours of unsupervised, noisy data

Author :

Heigold, Georg ; Nguyen, Patrick ; Weintraub, Mitchel ; Vanhoucke, Vincent

Author_Institution :

Google Inc., Mountain View, CA, USA

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

4437

Lastpage :

4440

Abstract :

The acoustic models in state-of-the-art speech recognition systems are based on phones in context that are represented by hidden Markov models. This modeling approach may be limited in that it is hard to incorporate long-span acoustic context. Exemplar-based approaches are an attractive alter-native, in particular if massive data and computational power are available. Yet, most of the data at Google are unsupervised and noisy. This paper investigates an exemplar-based approach under this yet not well understood data regime. A log-linear rescoring framework is used to combine the exemplar-based features on the word level with the first-pass model. This approach guarantees at least baseline performance and focuses on the refined modeling of words with sufficient data. Experimental results for the Voice Search and the YouTube tasks are presented.

Keywords :

hidden Markov models; speech recognition; Voice Search; YouTube task; acoustic model; exemplar-based feature; hidden Markov model; log-linear rescoring framework; noisy data; speech recognition; unsupervised data; Acoustics; Context; Hidden Markov models; Speech recognition; Training; Training data; YouTube; Exemplar-based speech recognition; conditional random fields; speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6288904

Filename :

6288904

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3163697