DocumentCode :
3784923
Title :
Boosted learning in dynamic Bayesian networks for multimodal speaker detection
Author :
A. Garg;V. Pavlovic;J.M. Rehg
Author_Institution :
IBM Almaden Res. Center, San Jose, CA, USA
Volume :
91
Issue :
9
fYear :
2003
Firstpage :
1355
Lastpage :
1369
Abstract :
Bayesian network models provide an attractive framework for multimodal sensor fusion. They combine an intuitive graphical representation with efficient algorithms for inference and learning. However, the unsupervised nature of standard parameter learning algorithms for Bayesian networks can lead to poor performance in classification tasks. We have developed a supervised learning framework for Bayesian networks, which is based on the Adaboost algorithm of Schapire and Freund. Our framework covers static and dynamic Bayesian networks with both discrete and continuous states. We have tested our framework in the context of a novel multimodal HCI application: a speech-based command and control interface for a Smart Kiosk. We provide experimental evidence for the utility of our boosted learning approach.
Keywords :
"Intelligent networks","Bayesian methods","Multimodal sensors","Supervised learning","Testing","Human computer interaction","Command and control systems","Boosting","User interfaces","Speech"
Journal_Title :
Proceedings of the IEEE
Publisher :
ieee
ISSN :
0018-9219
Type :
jour
DOI :
10.1109/JPROC.2003.817119
Filename :
1230214
Link To Document :
بازگشت