DocumentCode
2634950
Title
An Efficient Text Classification Algorithm in E-commerce Application
Author
Da-Sheng, Wu ; Qin-fen, Yu ; Li-juan, Liu
Author_Institution
Sch. of Inf. Eng., Zhejiang Forestry Coll., Lin´´an, China
Volume
4
fYear
2009
fDate
March 31 2009-April 2 2009
Firstpage
458
Lastpage
461
Abstract
In this paper, an efficient text classification algorithm for repeating-text information on the e-commerce site can automatically classify and sort the similar string. This algorithm will greatly increase the efficiency and accuracy of audited information. All tests show that for the number of information between 100 and 1000 the algorithm is very efficient, and the 1000 text information(strings) comparison can be controlled in two seconds. When the amount of information is over 1000, the computation time will be significantly increased. The precision can be rectified to adjust the relevant parameters of the algorithm, such as the number of the same substring in comparison results and the length of split string. For too short information, such as less than 10 words, the algorithm can be combined with the Levenshtein algorithm, in order to improve the text-search flexibility.
Keywords
Web sites; classification; electronic commerce; sorting; string matching; text analysis; Levenshtein algorithm; Web site; e-commerce; similar string sorting; text classification algorithm; text search; Application software; Automatic control; Business; Classification algorithms; Computer science; Costs; Educational institutions; Forestry; Testing; Text categorization; Text similarity; e-commerce; text classification algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location
Los Angeles, CA
Print_ISBN
978-0-7695-3507-4
Type
conf
DOI
10.1109/CSIE.2009.346
Filename
5171038
Link To Document