DocumentCode
588862
Title
Analyzing Sequence Data Based on Conditional Random Fields with Co-training
Author
Leilei Yang ; Guiquan Liu ; Qi Liu ; Lei Zhang ; Enhong Chen
Author_Institution
Univ. of Sci. & Technol. of China, Hefei, China
fYear
2012
fDate
17-18 Nov. 2012
Firstpage
94
Lastpage
98
Abstract
Sequence data plays an important role in data analysis applications, such as sequence classification. One important aspect of sequence data analysis is to obtain the labeled sequence data and use a machine learning model to predict the sequence structures. Conditional Random Fields (CRF) is such a machine learning method which is popular used in sequential data analysis. This is because that CRF can effectively capture the data correlations in context with abundant training data. However, in real applications, the labeled training data is usually difficult to be collected. In order to reduce the requirement of the amount of the labeled training data, a novel model is proposed named Conditional Random Fields with Co-training (Co-CRF). The Co-CRF model can work well even on the reduced labeled training data. Empirical results show that Co-CRF can produce a more accurate analysis than the traditional CRF, especially with very limited training data.
Keywords
data analysis; learning (artificial intelligence); pattern classification; random processes; Co-CRF model; conditional random field; data correlation; labeled sequence data; machine learning model; sequence classification; sequence data analysis; sequence structure prediction; Accuracy; Analytical models; Correlation; Data models; Hidden Markov models; Training; Training data; Classification; Co-training; Conditional Random Fields; Sequence Data;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Security (CIS), 2012 Eighth International Conference on
Conference_Location
Guangzhou
Print_ISBN
978-1-4673-4725-9
Type
conf
DOI
10.1109/CIS.2012.29
Filename
6405874
Link To Document