Title :
A Fast Exact Repeats Search Algorithm for Genome Analysis
Author :
Sun, Weidong ; Ma, Zongmin
Author_Institution :
Sch. of Inf. Sci. & Eng., Northeastern Univ., Shenyang, China
Abstract :
The repeat structure of genomic DNA is considered an essential mechanism for evolution and other fundamental biological functions. Repeats finding problems are always deemed as one of the prerequisites for genome sequencing and analysis, and among these problems exact repeat finding is the first step for most other repeats finding problems. In this paper, the authors propose a new fast algorithm based on simple count sort and radix sort to solve the exact repeat finding problems specifically, which performs more efficient than any existing exact repeat finding algorithm by the simplicity of data structure and the proposed algorithm itself. The algorithm can also be easily adapted for similar problems in proteome sequence analysis with little modification.
Keywords :
DNA; biology computing; data structures; genomics; search problems; biological function; data structure; exact repeat finding; fast exact repeats search algorithm; genome analysis; genome sequencing; genomic DNA; radix sort; repeat structure; repeats finding problem; simple count sort; Algorithm design and analysis; Bioinformatics; DNA; Data structures; Genomics; Iterative algorithms; Libraries; Pattern matching; Regulators; Sequences; Radix Sort; Regulator Detection; Repeats Finding;
Conference_Titel :
Hybrid Intelligent Systems, 2009. HIS '09. Ninth International Conference on
Conference_Location :
Shenyang
Print_ISBN :
978-0-7695-3745-0
DOI :
10.1109/HIS.2009.88