Title :
Context-dependent multi-class classification with unknown observation and class distributions with applications to bioinformatics
Author :
Baras, Alexander S. ; Baras, John S.
Author_Institution :
Dept. of Pathology, Univ. of Virginia Health Syst., Charlottesville, VA, USA
Abstract :
We consider the multi-class classification problem, based on vector observation sequences, where the conditional (given class observations) probability distributions for each class as well as the unconditional probability distribution of the observations are unknown. We develop a novel formulation that combines training with the quality of classification that can be obtained using the ´learned´ (via training) models. The parametric models we use are finite mixture models, where the same component densities are used in the model for each class, albeit with different mixture weights. Thus we use a model known as all-class-one-network (ACON) model in the neural network literature. We argue why this is a more appropriate model for context-dependent classification, as is common in bioinformatics. We derive rigorously the solution to this joint optimization problem. A key step in our approach is to consider a tight (provably) bound between the average Bayes error (the true minimal classification error) and the average model-based classification error. We rigorously show that the parameter estimates maximize the likelihood of the model-based class posterior probability distributions. We illustrate by application examples in the bioinformatics of cancer.
Keywords :
Bayes methods; bioinformatics; cancer; neural nets; optimisation; pattern classification; probability; statistical distributions; all class one network model; average Bayes error; average model based classification error; cancer bioinformatics; class distributions; context dependent multi-class classification; joint optimization problem; model based class posterior probability distributions; neural network; unknown observation; Bioinformatics; Biological system modeling; Context modeling; Maximum likelihood detection; Maximum likelihood estimation; Neural networks; Parameter estimation; Parametric statistics; Pattern recognition; Probability distribution;
Conference_Titel :
Decision and Control, 2009 held jointly with the 2009 28th Chinese Control Conference. CDC/CCC 2009. Proceedings of the 48th IEEE Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-3871-6
Electronic_ISBN :
0191-2216
DOI :
10.1109/CDC.2009.5400269