• DocumentCode
    2634950
  • Title

    An Efficient Text Classification Algorithm in E-commerce Application

  • Author

    Da-Sheng, Wu ; Qin-fen, Yu ; Li-juan, Liu

  • Author_Institution
    Sch. of Inf. Eng., Zhejiang Forestry Coll., Lin´´an, China
  • Volume
    4
  • fYear
    2009
  • fDate
    March 31 2009-April 2 2009
  • Firstpage
    458
  • Lastpage
    461
  • Abstract
    In this paper, an efficient text classification algorithm for repeating-text information on the e-commerce site can automatically classify and sort the similar string. This algorithm will greatly increase the efficiency and accuracy of audited information. All tests show that for the number of information between 100 and 1000 the algorithm is very efficient, and the 1000 text information(strings) comparison can be controlled in two seconds. When the amount of information is over 1000, the computation time will be significantly increased. The precision can be rectified to adjust the relevant parameters of the algorithm, such as the number of the same substring in comparison results and the length of split string. For too short information, such as less than 10 words, the algorithm can be combined with the Levenshtein algorithm, in order to improve the text-search flexibility.
  • Keywords
    Web sites; classification; electronic commerce; sorting; string matching; text analysis; Levenshtein algorithm; Web site; e-commerce; similar string sorting; text classification algorithm; text search; Application software; Automatic control; Business; Classification algorithms; Computer science; Costs; Educational institutions; Forestry; Testing; Text categorization; Text similarity; e-commerce; text classification algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Engineering, 2009 WRI World Congress on
  • Conference_Location
    Los Angeles, CA
  • Print_ISBN
    978-0-7695-3507-4
  • Type

    conf

  • DOI
    10.1109/CSIE.2009.346
  • Filename
    5171038