• DocumentCode
    434547
  • Title

    Designing a value based niche search engine using evolutionary strategies

  • Author

    Sengupta, Sourav ; Jansen, Bernard J.

  • Author_Institution
    Ind. Eng., The Pennsylvania State Univ., PA, USA
  • Volume
    1
  • fYear
    2005
  • fDate
    4-6 April 2005
  • Firstpage
    800
  • Abstract
    The advent of e-commerce and corporate intranets has led to the growth of organizational repositories containing large, fragmented, and unstructured document collections. Though it is difficult to retrieve relevant documents from such collections, it is relatively less cumbersome to define categories broadly classifying the information contained in the collection. Such categories lend value to the information contained in the collection. This research addresses the issue of improving retrieval accuracy of search engines that retrieve documents from organizational repositories using a value based approach. We test an evolutionary algorithm approach on a document collection. The precision of the search algorithm improved from 40% in generation 1 of the algorithm to nearly 90% in generation 10,000.
  • Keywords
    document handling; electronic commerce; evolutionary computation; information retrieval; search engines; corporate intranets; document collection; e-commerce; evolutionary algorithm; evolutionary strategy; organizational repository; retrieval accuracy; search algorithm; unstructured document collections; value based niche search engine; Business; Character generation; Clustering algorithms; Evolutionary computation; Genetics; Industrial engineering; Information retrieval; Search engines; Taxonomy; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology: Coding and Computing, 2005. ITCC 2005. International Conference on
  • Print_ISBN
    0-7695-2315-3
  • Type

    conf

  • DOI
    10.1109/ITCC.2005.125
  • Filename
    1428562