• DocumentCode
    40781
  • Title

    Reliable Radiation Hybrid Maps: An Efficient Scalable Clustering-Based Approach

  • Author

    Seetan, Raed I. ; Denton, Anne M. ; Al-Azzam, Omar ; Kumar, Ajit ; Iqbal, M. Javed ; Kianian, Shahryar F.

  • Author_Institution
    Dept. of Comput. Sci., North Dakota State Univ., Fargo, ND, USA
  • Volume
    11
  • Issue
    5
  • fYear
    2014
  • fDate
    Sept.-Oct. 1 2014
  • Firstpage
    788
  • Lastpage
    800
  • Abstract
    The process of mapping markers from radiation hybrid mapping (RHM) experiments is equivalent to the traveling salesman problem and, thereby, has combinatorial complexity. As an additional problem, experiments typically result in some unreliable markers that reduce the overall quality of the map. We propose a clustering approach for addressing both problems efficiently by eliminating unreliable markers without the need for mapping the complete set of markers. Traditional approaches for eliminating markers use resampling of the full data set, which has an even higher computational complexity than the original mapping problem. In contrast, the proposed approach uses a divide-and-conquer strategy to construct framework maps based on clusters that exclude unreliable markers. Clusters are ordered using parallel processing and are then combined to form the complete map. We present three algorithms that explore the trade-off between the number of markers included in the map and placement accuracy. Using an RHM data set of the human genome, we compare the framework maps from our proposed approaches with published physical maps and with the results of using the Carthagene tool. Overall, our approaches have a very low computational complexity and produce solid framework maps with good chromosome coverage and high agreement with the physical map marker order.
  • Keywords
    bioinformatics; cellular biophysics; computational complexity; divide and conquer methods; genomics; parallel databases; pattern clustering; Carthagene tool; RHM data set; chromosome coverage; combinatorial complexity; divide-and-conquer strategy; efficient scalable clustering-based approach; full data set resampling; human genome; original mapping problem; parallel processing; reliable radiation hybrid maps; solid framework maps; traveling salesman problem; Bioinformatics; Biological cells; Buildings; Clustering algorithms; Couplings; Sociology; Statistics; Framework mapping; bioinformatics; clustering; radiation hybrid mapping; travelling salesman problem;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2014.2329310
  • Filename
    6827170