• DocumentCode
    1809418
  • Title

    Poster: Sequence classification of homopolymer emissions (SCOPE)

  • Author

    Morton, James T. ; Abrudan, Patricia ; Liang, Chun ; Karro, John E.

  • Author_Institution
    Depts. of Comp. Sci., Miami Univ., Oxford, OH, USA
  • fYear
    2012
  • fDate
    23-25 Feb. 2012
  • Firstpage
    1
  • Lastpage
    1
  • Abstract
    In this poster we will describe SCOPE: a tool for the identification and removal of homopolymer sequences embedded in sequenced mRNA transcript fragments as a result of the Eukaryotic polyadenylation process, allowing for the identification and cleaning of next generation sequence transcriptome data of fragments added by post-transcriptional processes. By making use of Hidden Markov models trained on-the-fly we are able to detect the embedded homopolymers and accurately identify boundaries in the presence of high rates of sequencing error that might otherwise obscure them. Compared to the SeqClean tool [1], we see comparable to considerably improved sensitivity and specificity in identification, and a significantly improved ability to correctly detect homopolymer boundaries. SCOPE is being developed as an open-source tool, with a preliminary version that will be distributed to interested users upon request to the authors.
  • Keywords
    RNA; biological techniques; biology computing; hidden Markov models; molecular biophysics; molecular configurations; polymers; embedded homopolymer; eukaryotic polyadenylation process; hidden Markov model; homopolymer boundary; homopolymer emission; homopolymer sequence; next generation sequence transcriptome data; open-source tool; post-transcriptional process; sequence classification; sequenced mRNA transcript fragment; Cleaning; Educational institutions; Filtering; Hidden Markov models; Preforms; Sensitivity; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Advances in Bio and Medical Sciences (ICCABS), 2012 IEEE 2nd International Conference on
  • Conference_Location
    Las Vegas, NV
  • Print_ISBN
    978-1-4673-1320-9
  • Electronic_ISBN
    978-1-4673-1319-3
  • Type

    conf

  • DOI
    10.1109/ICCABS.2012.6182655
  • Filename
    6182655