Title :
Empirical Normalization for Quadratic Discriminant Analysis and Classifying Cancer Subtypes
Author :
Kon, Mark A. ; Nikolaev, Nikolay
Author_Institution :
Dept. of Math. & Stat., Boston Univ., Boston, MA, USA
Abstract :
We introduce a new discriminant analysis method (Empirical Discriminant Analysis or EDA) for binary classification in machine learning. Given a dataset of feature vectors, this method defines an empirical feature map transforming the training and test data into new data with components having Gaussian empirical distributions. This map is an empirical version of the Gaussian copula used in probability and mathematical finance. The purpose is to form a feature mapped dataset as close as possible to Gaussian, after which standard quadratic discriminants can be used for classification. We discuss this method in general, and apply it to some datasets in computational biology.
Keywords :
Gaussian distribution; biology computing; cancer; learning (artificial intelligence); pattern classification; Gaussian copula; Gaussian empirical distribution; binary classification; cancer subtypes classification; computational biology; empirical discriminant analysis; empirical feature map; empirical normalization; machine learning; quadratic discriminant analysis; Gaussian distribution; Jacobian matrices; Joints; Random variables; Support vector machine classification; Training; Vectors; cancer; classification; copula; discriminant;
Conference_Titel :
Machine Learning and Applications and Workshops (ICMLA), 2011 10th International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
978-1-4577-2134-2
DOI :
10.1109/ICMLA.2011.160