Title :
K-Similar Conditional Random Fields for Semi-supervised Sequence Labeling
Author :
Chen, Xi ; Chen, Shihong ; Xiao, Kun
Author_Institution :
Comput. Sch., Wuhan Univ., Wuhan
Abstract :
Sequence labeling tasks, such as named entity recognition and part of speech tagging, are the fundamental compositions of the information extraction system, and thus received attentions these years. This paper proposes k-similar conditional random fields for semi-supervised sequence labeling, and makes use of unlabeled data to calculate the similarity between words with distributional clustering. The named entity recognition experiments show that this method can improve the performance through unlabeled data.
Keywords :
knowledge acquisition; random processes; K-similar conditional random field; distributional clustering; information extraction system; named entity recognition; semisupervised sequence labeling; Data mining; Entropy; Hidden Markov models; Inference algorithms; Information technology; Labeling; Natural language processing; Semisupervised learning; Speech recognition; Tagging;
Conference_Titel :
Advanced Language Processing and Web Information Technology, 2008. ALPIT '08. International Conference on
Conference_Location :
Dalian Liaoning
Print_ISBN :
978-0-7695-3273-8
DOI :
10.1109/ALPIT.2008.16