• DocumentCode
    599171
  • Title

    Phylogenetic analysis of some leguminous trees using CLUSTALW2 bioinformatics tool

  • Author

    Patel, Surabhi ; Panchal, H. ; Anjaria, K.

  • Author_Institution
    Dept. of Comput., Sci. & Technol., Sardar Patel Univ., Anand, India
  • fYear
    2012
  • fDate
    4-7 Oct. 2012
  • Firstpage
    917
  • Lastpage
    921
  • Abstract
    A multiple sequence alignment (MSA) is a sequence alignment of three or more biological sequences, generally for Protein. MSA has wide range of applications and to cite few of them such as phylogenetic analysis, protein pattern identification, protein domain identification, prediction of protein structure, structural similarity of amino acids and to get evolutionary similarity. ClustalW2 is a general purpose global multiple sequence alignment program for proteins. It produces biologically meaningful multiple sequence alignments of divergent sequences. The output from ClustalW2 shows the best match for the selected sequences and lines up them in such a way that the identities, similarities and differences can be easily understood. Evolutionary relationships can be seen by creating Cladograms or Phylograms. Firstly, individual weights are assigned to each sequence in a partial alignment in order to down-weight near-duplicate sequences and up-weight the most divergent ones. Secondly, amino acid substitution matrices are varied at different alignment stages according to the divergence of the sequences to be aligned. Thirdly, residue-specific gap penalties and locally reduced gap penalties in hydrophilic regions encourage new gaps in potential loop regions rather than regular secondary structure. Fourthly, positions in early alignments where gaps have been opened receive locally reduced gap penalties to encourage the opening up of new gaps at these positions. In this paper, protein sequences of few legume species from UNIPROT database were taken and focused on MSA for protein sequences for these tree species of family Leguminosae, where ClustalW2 tool have used to generate biological data. The results are discussed with the help of Cladograms and Phylograms for selected tree species.
  • Keywords
    bioinformatics; database management systems; genetics; proteins; vegetation; CLUSTALW2 bioinformatics tool; Cladograms; MSA; Phylogram; UNIPROT database; amino acid structural similarity; amino acid substitution matrices; biological sequences; down-weight near-duplicate sequences; hydrophilic regions; leguminous trees; locally reduced gap penalty; multiple sequence alignment; phylogenetic analysis; potential loop regions; protein domain identification; protein pattern identification; protein structure prediction; residue-specific gap penalty; Bioinformatics; Educational institutions; Phylogeny; Protein engineering; Proteins; Vegetation; MSA- Multiple sequence alignment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine Workshops (BIBMW), 2012 IEEE International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    978-1-4673-2746-6
  • Electronic_ISBN
    978-1-4673-2744-2
  • Type

    conf

  • DOI
    10.1109/BIBMW.2012.6470264
  • Filename
    6470264