DocumentCode :
2634950
Title :
An Efficient Text Classification Algorithm in E-commerce Application
Author :
Da-Sheng, Wu ; Qin-fen, Yu ; Li-juan, Liu
Author_Institution :
Sch. of Inf. Eng., Zhejiang Forestry Coll., Lin´´an, China
Volume :
4
fYear :
2009
fDate :
March 31 2009-April 2 2009
Firstpage :
458
Lastpage :
461
Abstract :
In this paper, an efficient text classification algorithm for repeating-text information on the e-commerce site can automatically classify and sort the similar string. This algorithm will greatly increase the efficiency and accuracy of audited information. All tests show that for the number of information between 100 and 1000 the algorithm is very efficient, and the 1000 text information(strings) comparison can be controlled in two seconds. When the amount of information is over 1000, the computation time will be significantly increased. The precision can be rectified to adjust the relevant parameters of the algorithm, such as the number of the same substring in comparison results and the length of split string. For too short information, such as less than 10 words, the algorithm can be combined with the Levenshtein algorithm, in order to improve the text-search flexibility.
Keywords :
Web sites; classification; electronic commerce; sorting; string matching; text analysis; Levenshtein algorithm; Web site; e-commerce; similar string sorting; text classification algorithm; text search; Application software; Automatic control; Business; Classification algorithms; Computer science; Costs; Educational institutions; Forestry; Testing; Text categorization; Text similarity; e-commerce; text classification algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location :
Los Angeles, CA
Print_ISBN :
978-0-7695-3507-4
Type :
conf
DOI :
10.1109/CSIE.2009.346
Filename :
5171038
Link To Document :
بازگشت