• DocumentCode
    2992896
  • Title

    Application of SVM in Citation Information Extraction

  • Author

    Liang, Jiguang ; Layton, Robert ; Wang, Wei

  • Author_Institution
    Dept. of Educ. Technol., Nanjing Normal Univ., Nanjing, China
  • fYear
    2011
  • fDate
    24-28 Sept. 2011
  • Firstpage
    33
  • Lastpage
    35
  • Abstract
    Support Vector Machines are an effective form of binary-class classification algorithm. To enhance the utilization of text structural features for information extraction, which are greatly restricted by the Hidden Markov Model (HMM), this paper proposes a support vector machine multi-class classification based on Markov properties to extract the information from a citation database. The proposed model extracts symbol characteristics as features and composes a binary tree of the transition probabilities. Experiments show that the proposed method outperforms HMM and basic SVM methods.
  • Keywords
    citation analysis; classification; hidden Markov models; support vector machines; text analysis; Markov properties; SVM; binary tree; binary-class classification algorithm; citation database; citation information extraction; hidden Markov model; multiclass classification; support vector machine; symbol characteristics; text structural feature; transition probabilities; Binary trees; Data mining; Feature extraction; Hidden Markov models; Markov processes; Probability; Support vector machines; Support Vector Machine (SVM); classification; feature extraction; probability; symbol feature;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Complexity and Data Mining (IWCDM), 2011 First International Workshop on
  • Conference_Location
    Nanjing, Jiangsu
  • Print_ISBN
    978-1-4577-2007-9
  • Type

    conf

  • DOI
    10.1109/IWCDM.2011.15
  • Filename
    6128411