Title :
Mining Biological Sequences with Masks
Author :
Battaglia, Giovanni ; Grossi, Roberto ; Marangoni, Roberto ; Pisanti, Nadia
Author_Institution :
Dipt. di Inf., Univ. di Pisa, Pisa, Italy
fDate :
Aug. 31 2009-Sept. 4 2009
Abstract :
A new notion of motifs, called masks has been introduced, along with the tool MaskMiner to extract them. Masks can be seen as a succinct representation of the repeated patterns occurring in the given input sequence. In this paper we apply this paradigm to mine the sequences of two glutamate receptors of human and mouse genomes, and thus discover some properties concerning frequent masks.These experiments will also highlight some interesting peculiarities of MaskMiner.
Keywords :
biology computing; data mining; MaskMiner; biological sequence mining; glutamate receptors; human genomes; mouse genomes; Bioinformatics; Biological system modeling; Databases; Expert systems; Genomics; Humans; Inference algorithms; Mice; Pattern matching; Solids; Motif inference; doubling algorithm; partial order set; pattern with don´t care;
Conference_Titel :
Database and Expert Systems Application, 2009. DEXA '09. 20th International Workshop on
Conference_Location :
Linz
Print_ISBN :
978-0-7695-3763-4
DOI :
10.1109/DEXA.2009.47