• DocumentCode
    2820815
  • Title

    A modified cross entropy method for detecting multiple change points in DNA Count Data

  • Author

    Priyadarshana, M. ; Sofronov, Georgy

  • Author_Institution
    Dept. of Stat., Macquarie Univ., Sydney, NSW, Australia
  • fYear
    2012
  • fDate
    10-15 June 2012
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    We model DNA count data as a multiple change point problem, in which the data are divided in to different segments by an unknown number of change points. Each segment is supposed to be generated by unique distribution characteristics inherent to the underlying process. In this paper, we propose a modified version of the Cross-Entropy (CE) method, which utilizes Beta distribution to simulate locations of change points. Several stopping criterions are also discussed. The proposed CE method applies on over-dispersed count data, in which the observations are distributed as independent Negative Binomial. Furthermore, we incorporate the Bayesian Information Criterion to identify the optimal number of change points within the CE method while not fixing the maximum number of change points in the data sequence. We obtain estimates for the artificial data by using the modified CE method and compare the results with the general CE method, which utilizes normal distribution to simulate locations of the change points. The methods are applied to a real DNA count data set in order to illustrate the usefulness of the proposed modified CE method.
  • Keywords
    Bayes methods; DNA; binomial distribution; entropy; medical computing; normal distribution; Bayesian information criterion; CE method; DNA count data; artificial data estimation; beta distribution; change point location simulation; cross entropy method; data distribution characteristics; data sequence; multiple change point detection; negative binomial distribution; normal distribution; optimal change point number identification; stopping criterions; DNA; Data models; Distributed databases; Gaussian distribution; Shape; Standards; Vectors; Cross-Entropy method; DNA count data; change point problem; combinatorial optimization; stochastic optimization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation (CEC), 2012 IEEE Congress on
  • Conference_Location
    Brisbane, QLD
  • Print_ISBN
    978-1-4673-1510-4
  • Electronic_ISBN
    978-1-4673-1508-1
  • Type

    conf

  • DOI
    10.1109/CEC.2012.6256470
  • Filename
    6256470