DocumentCode :
582824
Title :
Gaussian Mixture Model for automatic motif finding in promoter sequences
Author :
Shuo, Guo ; Decheng, Yuan ; Mingzhong, Huang
Author_Institution :
Coll. of Inf. Eng., Shenyang Univ. of Chem. Technol., Shenyang, China
fYear :
2012
fDate :
25-27 July 2012
Firstpage :
7443
Lastpage :
7448
Abstract :
Motif finding is an important part of bioinformatics studies. This paper proposed an algorithm used for automatic motif finding. Gaussian Mixture Model is applied to build a motifs finding model in promoter sequence. The fuzzy cluster is used to determine the optimal numbers of GMM components and apply the initial values for the expectation maximization (EM) algorithm which is used to obtain the parameter estimates. The approach can identify the most important motifs around transcription start site and can also be used for other biological functional sequences motif finding. The simulation results show the proposed method is more effective for different motif finding than finding tools proposed in paper [10] and improves the precision of detection.
Keywords :
Gaussian processes; bioinformatics; expectation-maximisation algorithm; fuzzy set theory; pattern clustering; EM algorithm; GMM components; Gaussian mixture model; automatic motif finding; bioinformatics; biological functional sequences; expectation maximization algorithm; fuzzy cluster; motif finding model; parameter estimation; promoter sequences; transcription start site; Chemical technology; Clustering algorithms; DNA; Educational institutions; Electronic mail; Cluster analysis; Expectation maximization; Gaussian Mixture Model; Motif finding; Transcription start site;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control Conference (CCC), 2012 31st Chinese
Conference_Location :
Hefei
ISSN :
1934-1768
Print_ISBN :
978-1-4673-2581-3
Type :
conf
Filename :
6391258
Link To Document :
بازگشت