Title :
Interval Trees for Detection of Overlapping Genetic Entities
Author :
Mohammad, Fahim ; Flight, Robert M. ; Harrison, Benjamin J. ; Petruska, Jeffrey C. ; Rouchka, Eric C.
Author_Institution :
Dept. of Comput. Eng. & Comput. Sci., Univ. of Louisville, Louisville, KY, USA
Abstract :
A variety of systems exist in which annotations are available at various levels of granularity to a reference coordinate system, such as roads and landmarks on a map, features within a 2-dimensional or 3-dimensional image, or genetic entities (GEs) mapped to a reference genome. As the number of annotations grows, methods to efficiently locate overlapping entities within a specific interval of interest are needed. In this paper, the efficiency of using interval trees for storing, maintaining, and querying large numbers of intervals with special attention to genetic entities is demonstrated. The results suggest a significant speed -- up when compared to relational database approaches. As such, interval trees serve as a suitable alternative for storing and searching annotations to a reference coordinate system.
Keywords :
biological techniques; biology computing; genetics; relational databases; 2-dimensional imaging; 3-dimensional imaging; genetic entities; interval trees; overlapping genetic entity detection; reference coordinate system; relational database approaches; Bioinformatics; Browsers; Databases; Genomics; Humans; Probes; gene ID conversion; genetic annotations; identifier mapping; interval overlap; interval trees;
Conference_Titel :
Bioinformatics and Bioengineering (BIBE), 2011 IEEE 11th International Conference on
Conference_Location :
Taichung
Print_ISBN :
978-1-61284-975-1
DOI :
10.1109/BIBE.2011.49