مرکز منطقه ای اطلاع رساني علوم و فناوري - Hallucinated n-best lists for discriminative language modeling

DocumentCode :

3166707

Title :

Hallucinated n-best lists for discriminative language modeling

Author :

Sagae, K. ; Lehr, M. ; Prud´hommeaux, E. ; Xu, P. ; Glenn, N. ; Karakos, D. ; Khudanpur, S. ; Roark, B. ; Saraçlar, M. ; Shafran, I. ; Bikel, D. ; Callison-Burch, C. ; Cao, Y. ; Hall, K. ; Hasler, E. ; Koehn, P. ; Lopez, A. ; Post, M. ; Riley, D.

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

5001

Lastpage :

5004

Abstract :

This paper investigates semi-supervised methods for discriminative language modeling, whereby n-best lists are “hallucinated” for given reference text and are then used for training n-gram language models using the perceptron algorithm. We perform controlled experiments on a very strong baseline English CTS system, comparing three methods for simulating ASR output, and compare the results with training with “real” n-best list output from the baseline recognizer. We find that methods based on extracting phrasal cohorts - similar to methods from machine translation for extracting phrase tables - yielded the largest gains of our three methods, achieving over half of the WER reduction of the fully supervised methods.

Keywords :

language translation; natural language processing; English CTS system; baseline recognizer; discriminative language modeling; hallucinated n-best lists; machine translation; perceptron algorithm; reference text; semisupervised method; simulating ASR output; training n-gram language model; Data models; Hidden Markov models; Speech; Speech recognition; Training; Training data; Transducers; automatic speech recognition; discriminative training; language modeling; semi-supervised methods;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6289043

Filename :

6289043

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3166707