DocumentCode :
1844706
Title :
A Method for Text Categorization Using BP Network Based on Hadoop
Author :
Jia Yusheng ; Zhu Qing
Author_Institution :
Sch. of Software Eng., Beijing Univ. of Technol., Beijing, China
fYear :
2013
fDate :
21-23 June 2013
Firstpage :
818
Lastpage :
821
Abstract :
Based on the analysis of the Hadoop open source distributed computing platform as well as the parallel training methods for the BP network, for the disadvantage of time-consuming when using large amounts of texts to train the BP network, we designed a BP network text categorization model based on data parallel method on Hadoop platform using the MapReduce programming model. The model uses the method of batch training, it adjusts the network weights after getting the accumulated error by summing every sample training error on each node, and the categorization of text is done in parallel. The method based on Hadoop platform improves the training speed of BP network and efficiency of text categorization, and achieves good categorization performance.
Keywords :
backpropagation; neural nets; parallel algorithms; public domain software; text analysis; BP network; Hadoop open source distributed computing platform; MapReduce programming model; batch training; data parallel method; parallel training methods; text categorization; Neural networks; Neurons; Program processors; Support vector machine classification; Text categorization; Training; Vectors; BP Neural Network; Hadoop; Text Categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational and Information Sciences (ICCIS), 2013 Fifth International Conference on
Conference_Location :
Shiyang
Type :
conf
DOI :
10.1109/ICCIS.2013.219
Filename :
6643135
Link To Document :
بازگشت