Title :
Random Forests-Based Confidence Annotation Using Novel Features from Confusion Network
Author :
Xue, Jian ; Zhao, Yunxin
Author_Institution :
Dept. of Comput. Sci., Missouri Univ., Columbia, MO
Abstract :
In this paper, we propose a set of new features for confidence annotation, including three features derived from confusion network and one from statistical significance test. We also propose using random forests as confidence classifier. The new features are combined with a set of eight previously proposed confidence features, and the random forests is compared with decision tree and support vector machine. Experiments were conducted on telehealth captioning task with a vocabulary size of 46,489. Average confidence annotation accuracy of 84.69% was achieved on 5 doctors´ test set. In addition, random forests was shown useful for feature importance ranking. The proposed features are shown important in confidence annotation and random forests achieved best results among the three classifiers
Keywords :
decision trees; speech processing; speech recognition; statistical testing; support vector machines; telemedicine; confidence classifier; confusion network; decision tree; random forests-based confidence annotation; statistical significance test; support vector machine; telehealth captioning task; Classification tree analysis; Computer science; Decision trees; Entropy; Heart; Lattices; Natural languages; Support vector machine classification; Support vector machines; Testing;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
Print_ISBN :
1-4244-0469-X
DOI :
10.1109/ICASSP.2006.1660229