• DocumentCode
    3409394
  • Title

    Assigning gene ontology categories (GO) to yeast genes using text-based supervised learning methods

  • Author

    Izumitani, Tomonori ; Taira, Hirotoshi ; Kazawa, Hideto ; Maeda, Eisaku

  • Author_Institution
    NTT Commun. Sci. Labs., Kyoto, Japan
  • fYear
    2004
  • fDate
    16-19 Aug. 2004
  • Firstpage
    503
  • Lastpage
    504
  • Abstract
    We propose a method for assigning upper level gene ontology terms (GO categories) to genes using relevant documents. This method represents each gene as a vector using relevant documents to the gene. Then, binary classifiers are made for the GO categories using such supervised learning methods as support vector machines and maximum entropy method. We applied this method for assigning GO categories to yeast genes and achieved an average F-measure of 0.67, which is > 0.3 higher than the existing method developed by Raychaudhun et al. We also applied this method to genome-wide annotation for yeast by all GO Slim categories provided by SGD and achieved average F-measures of 0.58, 0.72, and 0.60, respectively, for the three GO parts: cellular component, molecular function, and biological process.
  • Keywords
    biology computing; cellular biophysics; entropy; genetics; learning (artificial intelligence); molecular biophysics; support vector machines; Saccharomyces genome database; binary classifiers; biological process; cellular component; gene ontology Slim categories; genome-wide annotation; maximum entropy method; molecular function; support vector machines; text-based supervised learning; yeast genes; Bioinformatics; Biological processes; Databases; Entropy; Fungi; Genomics; Ontologies; Supervised learning; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
  • Print_ISBN
    0-7695-2194-0
  • Type

    conf

  • DOI
    10.1109/CSB.2004.1332476
  • Filename
    1332476