Title :
Genomic signal processing
Author :
Cristea, Paul Dan
Author_Institution :
Biomed. Eng. Center, "Politehnica" Univ. of Bucharest, Romania
Abstract :
Summary form only given. The sequencing of several genomes offers the opportunity to data mine and to explore in depth this unique data repository. Converting the genomic sequences into digital genomic signals offers the possibility to use signal processing methods for handling and analyzing genomic information. Using the genomic signal approach, long range features, maintained over distances of 106-108 of base pairs have been found. In the context of analyzing large volumes of data and of presenting the results in a easy to read form, the problem of data representability becomes critical. In this paper, a novel mathematical description of data graphical representability, based on the data scattering ratio for a pixel, is defined and is applied for several typical cases of standard signals and for genomic signals. It is shown that the variation of genomic data along nucleotide sequences, specifically the cumulated and unwrapped phase, can be visualized adequately as simple graphic lines for low and large scales, while for medium scales (thousands to tens of thousands of base pairs) the statistical descriptions have to be used.
Keywords :
DNA; data structures; data visualisation; genetics; medical signal processing; base pair long range features; cumulated phase; data graphic visualization; data graphical representability; data mining; data representation; digital genomic signals; genomic sequences; genomic signal processing; nucleotide sequences; pixel data scattering ratio; statistical descriptions; unwrapped phase; Bioinformatics; Data mining; Data visualization; Digital signal processing; Genomics; Graphics; Information analysis; Scattering; Signal analysis; Signal processing;
Conference_Titel :
Neural Network Applications in Electrical Engineering, 2004. NEUREL 2004. 2004 7th Seminar on
Print_ISBN :
0-7803-8547-0
DOI :
10.1109/NEUREL.2004.1416514