DocumentCode :
2181317
Title :
A paired test for recognizer selection with untranscribed data
Author :
Raj, Bhiksha ; Singh, Rita ; Baker, James
Author_Institution :
Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
5676
Lastpage :
5679
Abstract :
Traditionally, the use of untranscribed speech has been restricted to unsupervised or semi-supervised training of acoustic models. Comparison of recognizers has required labeled data. In this paper we show how recognizers may be rank-ordered in terms of their performance using only a large quantity of untranscribed data, given a third "reference" recognizer. We develop statistical tests for comparing recognizers in this scenario. The accuracy of the reference system need not be known. Also, while the accuracy of the reference system affects the amount of data required, with enough data it only needs to perform better than chance. We show through detailed experiments that the rank ordering predicted from untranscribed data is indeed correct.
Keywords :
speech recognition; Speech recognition; acoustic models; recognizer selection paired test; statistical tests; third reference recognizer; untranscribed data; Accuracy; Adaptation models; Data models; Hidden Markov models; Joints; Speech recognition; Training; Hypothesis testing; Speech recognition; Unsupervised learning; Untranscribed data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947648
Filename :
5947648
Link To Document :
بازگشت