• DocumentCode
    1230769
  • Title

    Applying automatically derived gene-groups to automatically predict and refine metabolic pathways

  • Author

    Bansal, Arvind K. ; Woolverton, Christopher J.

  • Author_Institution
    Dept. of Comput. Sci., Kent State Univ., OH, USA
  • Volume
    15
  • Issue
    4
  • fYear
    2003
  • Firstpage
    883
  • Lastpage
    894
  • Abstract
    This paper describes an automated technique to predict integrated pathways and refine existing metabolic pathways using the information of automatically derived, functionally similar gene-groups and orthologs (functionally equivalent genes) derived by the comparison of complete microbial genomes archived in GenBank. The described method integrates automatically derived orthologous and homologous gene-groups (http://www.mcs.kent.edu/∼arvind/orthos.html) with the biochemical pathway template available at the KEGG database (http://www.genome.ad.jp), the enzyme information derived from the SwissProt enzyme database (http:// expasys.hcuge.ch/), and the Ligand database (http://www.genome.ad.jp). The technique refines existing pathways (based upon the network of reactions of enzymes) by associating corresponding nonenzymatic and regulatory proteins to enzymes and operons and by identifying substituting homologs. The technique is suitable for building and refining integrated pathways using evolutionary diverse organisms. A methodology and the corresponding algorithm are presented. The technique is illustrated by comparing the genomes of E coli and B. subtilis with M. tuberculosis. The findings about integrated pathways are briefly discussed.
  • Keywords
    biology computing; genetics; scientific information systems; GenBank; KEGG database; Ligand database; SwissProt enzyme database; automatically derived gene-groups; biochemical pathway template; complete microbial genomes; evolutionary diverse organisms; homologous gene-groups; metabolic pathway prediction; metabolic pathway refinement; orthologs; regulatory proteins; Biochemistry; Bioinformatics; Databases; Energy management; Genomics; Machinery; Microorganisms; Organisms; Pathogens; Proteins;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2003.1209006
  • Filename
    1209006