DocumentCode :
3589495
Title :
Research on energy-efficient text classification
Author :
Hao Lin
Author_Institution :
Sch. of Software, Shanghai Jiao Tong Univ., Shanghai, China
fYear :
2014
Firstpage :
257
Lastpage :
261
Abstract :
People rely on data mining techniques like text categorization more and more to explore valuable information, due to the ever-increasing electronic documents produced. Although the energy consumed by text categorization increases with the data, people usually pay attention to its effectiveness and there is little research about its energy cost. In this paper, we evaluate the energy cost of different classifiers and reduce energy cost by parallelization, trying to find a classifier that performs best on both aspects - effectiveness and efficiency. Several classifiers are obtained by using existing libraries or implementing classification algorithms. Comprehensive experiments on three real datasets show that an improved version of Naive Bayes can have competitive precision compared to SVM while has low energy costs. Parallelization can further reduce its energy cost by a factor of 10 for RCV1 dataset.
Keywords :
energy conservation; pattern classification; text analysis; Naive Bayes; RCV1 dataset; SVM; data mining techniques; energy cost; energy-efficient text classification; Computers; Energy measurement; Estimation; Libraries; Support vector machines; Text categorization; Training; Classification algorithms; Green computing; Parallel processing; Performance evaluation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology and Electronic Commerce (ICITEC), 2014 2nd International Conference on
Print_ISBN :
978-1-4799-5298-4
Type :
conf
DOI :
10.1109/ICITEC.2014.7105614
Filename :
7105614
Link To Document :
بازگشت