DocumentCode :
3660339
Title :
Parallelized text classification algorithm for processing large scale TCM clinical data with MapReduce
Author :
Xianju Fei;XiaoFang Li;Chunti Shen
Author_Institution :
Department of Computer &
fYear :
2015
Firstpage :
1983
Lastpage :
1986
Abstract :
There are many opportunities and challenges in data analytic research for TCM (Traditional Chinese Medicine) in advent of big data era, like various clinical record sources, different symptom descriptions, lots of collected clinical symptoms, more than one syndrome attached to one clinical record and etc. Novel methods on support vector machines, ensemble learning, feature selection, multi-label learning in machine learning field are proposed to meet the challenges. When dealing with large scale clinical data of TCM, the accuracy of a multi-class classifier is lower. The training process of SVM is difficult to be parallel processing and has a slower computational speed. To improve the efficiency of TCM data processing, we propose a parallelized text classification algorithm for processing large scale TCM clinical data with MapReduce.
Keywords :
"Classification algorithms","Algorithm design and analysis","Support vector machines","Training","Prediction algorithms","Tongue","Text categorization"
Publisher :
ieee
Conference_Titel :
Information and Automation, 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/ICInfA.2015.7279613
Filename :
7279613
Link To Document :
بازگشت