DocumentCode :
454730
Title :
Random Forests-Based Confidence Annotation Using Novel Features from Confusion Network
Author :
Xue, Jian ; Zhao, Yunxin
Author_Institution :
Dept. of Comput. Sci., Missouri Univ., Columbia, MO
Volume :
1
fYear :
2006
fDate :
14-19 May 2006
Abstract :
In this paper, we propose a set of new features for confidence annotation, including three features derived from confusion network and one from statistical significance test. We also propose using random forests as confidence classifier. The new features are combined with a set of eight previously proposed confidence features, and the random forests is compared with decision tree and support vector machine. Experiments were conducted on telehealth captioning task with a vocabulary size of 46,489. Average confidence annotation accuracy of 84.69% was achieved on 5 doctors´ test set. In addition, random forests was shown useful for feature importance ranking. The proposed features are shown important in confidence annotation and random forests achieved best results among the three classifiers
Keywords :
decision trees; speech processing; speech recognition; statistical testing; support vector machines; telemedicine; confidence classifier; confusion network; decision tree; random forests-based confidence annotation; statistical significance test; support vector machine; telehealth captioning task; Classification tree analysis; Computer science; Decision trees; Entropy; Heart; Lattices; Natural languages; Support vector machine classification; Support vector machines; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1660229
Filename :
1660229
Link To Document :
بازگشت