DocumentCode
1809418
Title
Poster: Sequence classification of homopolymer emissions (SCOPE)
Author
Morton, James T. ; Abrudan, Patricia ; Liang, Chun ; Karro, John E.
Author_Institution
Depts. of Comp. Sci., Miami Univ., Oxford, OH, USA
fYear
2012
fDate
23-25 Feb. 2012
Firstpage
1
Lastpage
1
Abstract
In this poster we will describe SCOPE: a tool for the identification and removal of homopolymer sequences embedded in sequenced mRNA transcript fragments as a result of the Eukaryotic polyadenylation process, allowing for the identification and cleaning of next generation sequence transcriptome data of fragments added by post-transcriptional processes. By making use of Hidden Markov models trained on-the-fly we are able to detect the embedded homopolymers and accurately identify boundaries in the presence of high rates of sequencing error that might otherwise obscure them. Compared to the SeqClean tool [1], we see comparable to considerably improved sensitivity and specificity in identification, and a significantly improved ability to correctly detect homopolymer boundaries. SCOPE is being developed as an open-source tool, with a preliminary version that will be distributed to interested users upon request to the authors.
Keywords
RNA; biological techniques; biology computing; hidden Markov models; molecular biophysics; molecular configurations; polymers; embedded homopolymer; eukaryotic polyadenylation process; hidden Markov model; homopolymer boundary; homopolymer emission; homopolymer sequence; next generation sequence transcriptome data; open-source tool; post-transcriptional process; sequence classification; sequenced mRNA transcript fragment; Cleaning; Educational institutions; Filtering; Hidden Markov models; Preforms; Sensitivity; Viterbi algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Advances in Bio and Medical Sciences (ICCABS), 2012 IEEE 2nd International Conference on
Conference_Location
Las Vegas, NV
Print_ISBN
978-1-4673-1320-9
Electronic_ISBN
978-1-4673-1319-3
Type
conf
DOI
10.1109/ICCABS.2012.6182655
Filename
6182655
Link To Document