• DocumentCode
    2516141
  • Title

    Biological Feature Incorporated Alignment for Cross Species Analysis on Carbohydrate Binding Modules

  • Author

    Chou, Wei-Yao ; Lin, Shu-Chuan ; Huang, Rong-Yuan ; Jiang, Ting-Ying ; Chen, Chien-Jung ; Wu, Chia-Mao ; Tang, Chuan-Yi ; Chang, Margaret Dah-Tsyr ; Chou, Wei-I ; Chang, Hao-Teng

  • Author_Institution
    Dept. of Comput. Sci., NTHU, Hsinchu, Taiwan
  • fYear
    2009
  • fDate
    1-4 Nov. 2009
  • Firstpage
    20
  • Lastpage
    25
  • Abstract
    Multiple sequence alignment is widely applied to discover core conserved regions among query sequences. However, the major deficiency is that alignment accuracy is extremely sensitive to primary sequence identity, which causes alignment of low identity sequences difficult. We propose a feature-integrated model called feature-incorporated alignment (FIA) which integrates relevant biological characteristics including aromatic amino acids, hydrophilicity, beta-stranded structure, and BLOSUM62 matrix to locate ligand-binding residue in carbohydrate binding modules (CBMs), a protein family with fairly low sequence identify but highly functional correlation. The results indicated that FIA can not only detect aromatic residues on the outer surface of structure, but also achieve better accuracy than ClustalW2 and DIALIGN-TX on entropy criterion in all three test datasets from CBMs. Computational analysis in CBMs can facilitate the discovery of crucial ligand-binding residues of carbohydrate-active enzymes.
  • Keywords
    bioinformatics; enzymes; molecular biophysics; BLOSUM62 matrix; ClustalW2; DIALIGN-TX; aromatic amino acids; beta-stranded structure; biological feature incorporated alignment; carbohydrate binding modules; computational analysis; core conserved regions; cross species analysis; enzymes; hydrophilicity; ligand-binding residue; multiple sequence alignment; primary sequence identity; protein; query sequences; Amino acids; Biochemistry; Bioinformatics; Biological system modeling; Biology; Cells (biology); Computer science; Proteins; Sequences; Topology; aromatic amino acid; carbohydrate binding modules; hydrophilicity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine, 2009. BIBM '09. IEEE International Conference on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    978-0-7695-3885-3
  • Type

    conf

  • DOI
    10.1109/BIBM.2009.53
  • Filename
    5341879