• DocumentCode
    1220107
  • Title

    Competitor Mining with the Web

  • Author

    Bao, Shenghua ; Li, Rui ; Yu, Yong ; Cao, Yunbo

  • Author_Institution
    Shanghai Jiao Tong Univ., Shanghai
  • Volume
    20
  • Issue
    10
  • fYear
    2008
  • Firstpage
    1297
  • Lastpage
    1310
  • Abstract
    This paper is concerned with the problem of mining competitors from the Web automatically. Nowadays the fierce competition in the market necessitates every company not only to know which companies are its primary competitors, but also in which fields the company´s rivals compete with itself and what its competitors´ strength is in a specific competitive domain. The task of competitor mining that we address in the paper includes mining all the information such as competitors, competing fields and competitors´ strength. A novel algorithm called CoMiner is proposed, which tries to conduct a Web-scale mining in a domain-independent manner. The CoMiner algorithm consists of three parts: 1) given an input entity, extracting a set of comparative candidates and then ranking them according to comparability; 2) extracting the fields in which the given entity and its competitors play against each other; 3) identifying and summarizing the competitive evidence that details the competitors´ strength. As for evaluation, a prototype system implementing the CoMiner algorithm is presented. An evaluation data set consisting of 70 entities is constructed. 728 competitors and 3,640 competitive fields with 6,381 competitive evidences are discovered with the prototype. The experimental results show that the proposed algorithm is highly effective.
  • Keywords
    Internet; data mining; CoMiner algorithm; Web-scale mining; competitor mining; information retrieval; information search; Content Analysis and Indexing; Information Search and Retrieval; Performance evaluation;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2008.98
  • Filename
    4522551