DocumentCode
3301499
Title
A confusion network based confidence measure for active learning in speech recognition
Author
Chen, Wei ; Liu, Gang ; Guo, Jun
Author_Institution
Pattern Recognition & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing
fYear
2008
fDate
19-22 Oct. 2008
Firstpage
1
Lastpage
6
Abstract
Speech recognition systems are usually trained using tremendous transcribed utterances, and training data preparation is intensively time-consuming and costly. Aiming at reducing the number of training examples to be labeled, active learning is used in acoustic modeling of speech recognition, this learning scheme iteratively inspects the unlabeled samples, selects the most informative samples corresponding to a certain criterion, then annotates them, and adds the newly transcribed samples to the training set to update the acoustic model. Concerning about the importance of the criterion to select the most informative samples, we proposed a confidence measure computed by confusion network, and used this measure as the criterion for sample selection to improve the efficiency of active learning in acoustic modeling. Our experiments show that active learning, which adopts the proposed confidence measure, can achieve 31% maximum reduction of labeled data compared with random selection method.
Keywords
learning (artificial intelligence); speech recognition; acoustic modeling; active learning; confidence measure; confusion network; random selection method; speech recognition systems; training data preparation; transcribed utterances; Acoustic measurements; Automatic speech recognition; Convergence; Error analysis; Hidden Markov models; Intelligent systems; Laboratories; Pattern recognition; Speech recognition; Training data; Active learning; Confidence Measure; Confusion network;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-4515-8
Electronic_ISBN
978-1-4244-2780-2
Type
conf
DOI
10.1109/NLPKE.2008.4906813
Filename
4906813
Link To Document