• DocumentCode
    1880232
  • Title

    A highly efficient distributed indexing system based on large cluster of commodity machines

  • Author

    Pole, Govind S. ; Potey, Madhuri A.

  • Author_Institution
    Dept. of Comput. Eng., D.Y. Patil Coll. of Eng., Pune, India
  • fYear
    2012
  • fDate
    20-22 Sept. 2012
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    An Information Retrieval System using centralized approach demands long time to update the web index. A highly efficient distributed indexing system operates on large & diverse datasets with optimum time consumption compared to centralized approach to update web index. In this paper, a prototype model of highly efficient distributed indexing system deployed to run on cluster of commodity machines for the creation of large index using functionality of Apache Lucene. Experimental results showed efficiency of distributed indexing process. This distributed approach helps to reduce time interval for index creation and updation, in turn keeps the index content more fresh.
  • Keywords
    Internet; indexing; information retrieval systems; Apache Lucene functionality; Web index update; centralized approach; commodity machine cluster; highly efficient distributed indexing system; index content; information retrieval system; large index creation; Computers; Educational institutions; Indexing; Search engines; Standards; Web pages; commodity computing; dataset; distributed indexers; lucene; parser; retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Wireless and Optical Communications Networks (WOCN), 2012 Ninth International Conference on
  • Conference_Location
    Indore
  • ISSN
    2151-7681
  • Print_ISBN
    978-1-4673-1988-1
  • Type

    conf

  • DOI
    10.1109/WOCN.2012.6335562
  • Filename
    6335562