مرکز منطقه ای اطلاع رساني علوم و فناوري - Multi-view and multi-objective semi-supervised learning for large vocabulary continuous speech recognition

DocumentCode :

2175063

Title :

Multi-view and multi-objective semi-supervised learning for large vocabulary continuous speech recognition

Author :

Cui, Xiaodong ; Huang, Jing ; Chien, Jen-Tzung

Author_Institution :

IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA

fYear :

2011

fDate :

22-27 May 2011

Firstpage :

4668

Lastpage :

4671

Abstract :

Current hidden Markov acoustic modeling for large vocabulary continuous speech recognition (LVCSR) relies on the availability of abundant labeled transcriptions. Given that speech labeling is both expensive and time-consuming while there is a huge amount of unlabeled data easily available nowadays, semi-supervised learning (SSL) from both labeled and unlabeled data which aims to reduce the development cost for LVCSR becomes more important than ever. In this paper, we propose SSL for LVCSR by using the multiple views learned from different acoustic features and randomized decision trees. In addition, we develop the multi-objective learning of HMM-based acoustic models by optimizing a hybrid criterion which is established by the combination of the discriminative mutual information from labeled data and the entropy from unlabeled data. Experiments conducted on Broadcast News show the benefits of proposed methods.

Keywords :

hidden Markov models; speech recognition; HMM-based acoustic models; LVCSR; SSL; broadcast news; large vocabulary continuous speech recognition; multiobjective semisupervised learning; Decision trees; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Training; Vegetation; LVCSR; discriminative training; multi-objective learning; multi-view; semi-supervised learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location :

Prague

ISSN :

1520-6149

Print_ISBN :

978-1-4577-0538-0

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2011.5947396

Filename :

5947396

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2175063