Title :
A frequency based encoding technique for transformation of categorical variables in mixed IVF dataset
Author :
Uyar, Asli ; Bener, Ayse ; Ciray, H. Nadir ; Bahceci, Mustafa
Author_Institution :
Bogazici Univ., Istanbul, Turkey
Abstract :
Implantation prediction of in-vitro fertilization (IVF) embryos is critical for the success of the treatment. In this study, support vector machine (SVM) method has been used on an original IVF dataset for classification of embryos according to implantation potentials. The dataset we analyzed includes both categorical and continuous feature values. Transformation of categorical variables into numeric attributes is an important pre-processing stage for SVM affecting the performance of the classification. We have proposed a frequency based encoding technique for transformation of categorical variables. Experimental results revealed that, the proposed technique significantly improved the performance of IVF implantation prediction in terms of area under ROC curve (0.712 plusmn 0.032) compared to common binary encoding and expert judgement based transformation methods (0.676 plusmn 0.033 and 0.696 plusmn 0.024, respectively).
Keywords :
encoding; medical computing; support vector machines; area under ROC curve; binary encoding; categorical variables; embryo classification; frequency-based encoding technique; implantation prediction; in vitro fertilization; mixed IVF dataset; support vector machine; transformation methods; Algorithms; Artificial Intelligence; Decision Support Systems, Clinical; Decision Support Techniques; Fertilization in Vitro; Humans; Outcome Assessment (Health Care); Pattern Recognition, Automated; Prognosis;
Conference_Titel :
Engineering in Medicine and Biology Society, 2009. EMBC 2009. Annual International Conference of the IEEE
Conference_Location :
Minneapolis, MN
Print_ISBN :
978-1-4244-3296-7
Electronic_ISBN :
1557-170X
DOI :
10.1109/IEMBS.2009.5334548