Title :
A paired test for recognizer selection with untranscribed data
Author :
Raj, Bhiksha ; Singh, Rita ; Baker, James
Author_Institution :
Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
Traditionally, the use of untranscribed speech has been restricted to unsupervised or semi-supervised training of acoustic models. Comparison of recognizers has required labeled data. In this paper we show how recognizers may be rank-ordered in terms of their performance using only a large quantity of untranscribed data, given a third "reference" recognizer. We develop statistical tests for comparing recognizers in this scenario. The accuracy of the reference system need not be known. Also, while the accuracy of the reference system affects the amount of data required, with enough data it only needs to perform better than chance. We show through detailed experiments that the rank ordering predicted from untranscribed data is indeed correct.
Keywords :
speech recognition; Speech recognition; acoustic models; recognizer selection paired test; statistical tests; third reference recognizer; untranscribed data; Accuracy; Adaptation models; Data models; Hidden Markov models; Joints; Speech recognition; Training; Hypothesis testing; Speech recognition; Unsupervised learning; Untranscribed data;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947648