DocumentCode :
1784823
Title :
Selection of preprocessing methodology for multivariate regression of cellular FTIR and Raman spectra in radiobiological analyses
Author :
Meade, Aidan D. ; Clarke, Christopher ; Byrne, Hugh J. ; Lyng, Fiona M.
Author_Institution :
Sch. of Phys., Dublin Inst. of Technol., Dublin, Ireland
fYear :
2014
fDate :
2-5 Nov. 2014
Firstpage :
254
Lastpage :
260
Abstract :
Vibrational spectra of biological species suffer from the influence of many extraneous interfering factors that require removal through preprocessing before analysis. The present study was conducted to optimise the preprocessing methodology and variable subset selection during regression of and confocal Raman microspectroscopy (CRM) and Fourier Transform Infrared microspectroscopy (FTIRM) spectra against ionizing radiation dose. Skin cells were γ-irradiated in-vitro and their Raman and FTIRM spectra were used to retrospectively predict the radiation dose using linear and nonlinear partial least squares (PLS) regression algorithms in addition to support vector regression (SVR). The optimal preprocessing methodology (which comprised combinations of spectral filtering, baseline subtraction, scaling and normalization options) was selected using a genetic algorithm (GA) with the root mean squared error of prediction (RMSEP) used as the fitness criterion for selection of the preprocessing chromosome (where this was calculated on an independent set of test spectra randomly selected from the dataset on each pass of the algorithm). The results indicated that GA selection of the optimal preprocessing methodology substantially improved the predictive capacity of the regression algorithms over baseline methodologies, although the optimal preprocessing chromosomes were similar for various regression algorithms, suggesting an optimal preprocessing methodology for radiobiological analyses with biospectroscopy. Feature selection of both FTIRM and CRM spectra using genetic algorithms and multivariate regression provided further decreases in RMSEP, but only with non-linear multivariate regression algorithms.
Keywords :
Fourier transform infrared spectra; Raman spectra; biological effects of gamma-rays; biomedical measurement; cellular effects of radiation; feature selection; genetic algorithms; least squares approximations; mean square error methods; medical signal processing; regression analysis; skin; support vector machines; vibrational states; γ-irradiation; CRM; FTIRM spectra; Fourier Transform Infrared microspectroscopy spectra; GA selection; PLS; RMSEP; Raman spectra; SVR; baseline methodologies; baseline subtraction; biological species; biospectroscopy; cellular FTIR; confocal Raman microspectroscopy; extraneous interfering factors; feature selection; fitness criterion; genetic algorithm; ionizing radiation dose; nonlinear multivariate regression algorithms; nonlinear partial least squares regression algorithms; normalization options; optimal preprocessing chromosomes; optimal preprocessing methodology; preprocessing chromosome selection; preprocessing methodology selection; radiobiological analyses; root mean squared error of prediction; scaling options; skin cells; spectral filtering; support vector regression; test spectra; variable subset selection; vibrational spectra; Algorithm design and analysis; Biological cells; Customer relationship management; Filtering; Genetic algorithms; Prediction algorithms; Radiation effects; genetic algorithm; multivariate regression; preprocessing; radiobiology; vibrational spectra;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2014 IEEE International Conference on
Conference_Location :
Belfast
Type :
conf
DOI :
10.1109/BIBM.2014.6999164
Filename :
6999164
Link To Document :
بازگشت