DocumentCode
2992896
Title
Application of SVM in Citation Information Extraction
Author
Liang, Jiguang ; Layton, Robert ; Wang, Wei
Author_Institution
Dept. of Educ. Technol., Nanjing Normal Univ., Nanjing, China
fYear
2011
fDate
24-28 Sept. 2011
Firstpage
33
Lastpage
35
Abstract
Support Vector Machines are an effective form of binary-class classification algorithm. To enhance the utilization of text structural features for information extraction, which are greatly restricted by the Hidden Markov Model (HMM), this paper proposes a support vector machine multi-class classification based on Markov properties to extract the information from a citation database. The proposed model extracts symbol characteristics as features and composes a binary tree of the transition probabilities. Experiments show that the proposed method outperforms HMM and basic SVM methods.
Keywords
citation analysis; classification; hidden Markov models; support vector machines; text analysis; Markov properties; SVM; binary tree; binary-class classification algorithm; citation database; citation information extraction; hidden Markov model; multiclass classification; support vector machine; symbol characteristics; text structural feature; transition probabilities; Binary trees; Data mining; Feature extraction; Hidden Markov models; Markov processes; Probability; Support vector machines; Support Vector Machine (SVM); classification; feature extraction; probability; symbol feature;
fLanguage
English
Publisher
ieee
Conference_Titel
Complexity and Data Mining (IWCDM), 2011 First International Workshop on
Conference_Location
Nanjing, Jiangsu
Print_ISBN
978-1-4577-2007-9
Type
conf
DOI
10.1109/IWCDM.2011.15
Filename
6128411
Link To Document