DocumentCode :
2455831
Title :
Aggregating Multiple Biological Measurements Per Patient
Author :
Zubek, Valentina Bayer ; Khan, Faisal M.
fYear :
2010
fDate :
12-14 Dec. 2010
Firstpage :
788
Lastpage :
792
Abstract :
Many machine learning algorithms require a single value per feature per record for modeling. However, there are applications, in the medical domain particularly, where a single record may have multiple observations for the same feature. For example, a patient could have the same gene analyzed in multiple tissue slides of a biopsy, or could have the same genetic test performed on multiple subsequent biopsies. The challenge in these applications is how to integrate multiple observations of the same predictor feature per record. In this paper, two data aggregation methods are compared, one method is a simple median aggregation of feature values, while the other is a novel method which constructs intervals of values for each feature. The aggregated features are passed as input to a novel support vector regression method for modeling survival data in a prostate cancer setting. The performance of both methods was similar in predicting prostate cancer progression on three data cohorts.
Keywords :
cancer; data handling; medical computing; regression analysis; support vector machines; biopsy; data aggregation methods; machine learning algorithms; median aggregation; prostate cancer setting; support vector regression method; Analytical models; Biopsy; Data models; Indexes; Kernel; Prostate cancer; Support vector machines; aggregating multiple measurements per feature; interval data aggregation; prostate cancer; support vector regression for censored data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4244-9211-4
Type :
conf
DOI :
10.1109/ICMLA.2010.120
Filename :
5708943
Link To Document :
بازگشت