• DocumentCode
    2641163
  • Title

    Biological terms boundary identification by maximum entropy model

  • Author

    Wang, Jian ; Shao, Wenwu ; Zhu, Fei

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
  • fYear
    2011
  • fDate
    21-23 June 2011
  • Firstpage
    2446
  • Lastpage
    2448
  • Abstract
    There are a large number of biological data which are produced by life science experiments. How to use these data to carry out life discussion effectively supported by mathematics, computer science is a significant problem. Biological terms identification is one of the important research issues in the area of Bioinformatics. Besides, Maximum entropy model is widely used in various fields. This noun sounds profound, but its principle is very simple. As a statistical method, it has many features: for instance, subtle features can be controlled and reusable, it is also understood easily and so on. This model was first introduced in the sentence segmentation. In this paper, an example of the introduction of the concept of maximum entropy model, about the maximum entropy model was applied to Biological text terms boundary identification. Additionally, compared to the general terms boundary identification to the ME model, to illustrate the advantages of the introduction of maximum entropy model.
  • Keywords
    bioinformatics; maximum entropy methods; statistical analysis; text analysis; bioinformatics; biological text terms boundary identification; life science experiments; maximum entropy model; sentence segmentation; statistical method; Biological information theory; Biological system modeling; Computational modeling; Dictionaries; Entropy; Proteins; Biological text; biological text mining; maximum entropy model; terms boundary identification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial Electronics and Applications (ICIEA), 2011 6th IEEE Conference on
  • Conference_Location
    Beijing
  • ISSN
    pending
  • Print_ISBN
    978-1-4244-8754-7
  • Electronic_ISBN
    pending
  • Type

    conf

  • DOI
    10.1109/ICIEA.2011.5976003
  • Filename
    5976003