• DocumentCode
    3516409
  • Title

    pFilter: global information filtering and dissemination using structured overlay networks

  • Author

    Tang, Chunqiang ; Xu, Zhichen

  • Author_Institution
    Dept. of Comput. Sci., Rochester Univ., NY, USA
  • fYear
    2003
  • fDate
    28-30 May 2003
  • Firstpage
    24
  • Lastpage
    30
  • Abstract
    The exponential data growth rate of the Internet makes it increasingly difficult for people to find desired information in a timely fashion. Information filtering and dissemination systems allow users to register persistent queries called user profiles, and notify users when relevant files become available. Existing such systems, however, either are not scalable, or do not support matching of unstructured documents (e.g., text, HTML, image, audio or video files) that account for a significant percentage of Internet contents. We propose pFilter a global-scale, decentralized information filtering and dissemination system for unstructured documents. To handle potentially billions of documents for millions of subscribers, pFilter connects a large number of computers into a structured peer-to-peer overlay network. Computers in the overlay collectively publish or collect documents, build indices, register profiles, filter and disseminate documents. Profiles and documents are distributed through the network according to their semantics such that they can be matched efficiently and accurately without excessive flooding. pFilter employs scalable application-level multicast to deliver matching documents to a large number of interested parties efficiently.
  • Keywords
    Internet; document delivery; document handling; information dissemination; information filters; Internet; document delivery; document handling; global information filtering; information dissemination; peer-to-peer network; pfilter; structured overlay network; user profile; Computer networks; Computer science; Distributed computing; HTML; Information filtering; Information filters; Internet; Peer to peer computing; Registers; Search engines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Distributed Computing Systems, 2003. FTDCS 2003. Proceedings. The Ninth IEEE Workshop on Future Trends of
  • ISSN
    1071-0485
  • Print_ISBN
    0-7695-1910-5
  • Type

    conf

  • DOI
    10.1109/FTDCS.2003.1204290
  • Filename
    1204290