• DocumentCode
    3378158
  • Title

    Secured distributed document clustering & keyphrase extraction algorithm in structured Peer to Peer networks

  • Author

    Nair, Vijith Vijayakumaran ; Judith, J.E. ; JayaKumari, J.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Noorul Islam Centre for Higher Educ., Kumaracoil, India
  • fYear
    2011
  • fDate
    21-22 July 2011
  • Firstpage
    151
  • Lastpage
    156
  • Abstract
    A secured Hierarchically Distributed Peer-to-Peer (HDP2PC) architecture and Clustering algorithm is used to overcome the scalability problem in structured peer to peer networks. It is possible to incorporate any number of layers of nodes. The architecture is based on a multilayer overlay network of peer neighbourhoods. Supernodes, which act as representatives of neighbourhoods, are iteratively grouped to form higher level neighbourhoods. Within a certain level of the hierarchy, peers cooperate within their respective neighbourhoods to perform P2P clustering. A novel approach is proposed while indexing the documents in to various nodes arranged in hierarchy. A hashing mechanism is used to index the documents. A number of filters are applied as parameters thereby reducing the number of comparisons required to extract keyphrases. Distributed key phrase extraction algorithm is used to extract patterns by interpreting clusters stored in the neighbour workstations. The query can be applied for loosely structured format also. Speedup is provided by manipulating the neighbourhood size and height parameters. Privacy is also provided to data inside the peers. No data is shared between the peer nodes. Security can be enforced in the peers while clustering is performed.
  • Keywords
    cryptography; data mining; document handling; file organisation; peer-to-peer computing; HDP2PC architecture; P2P clustering; clustering algorithm; document indexing; hashing mechanism; keyphrase extraction algorithm; loosely structured format; multilayer overlay network; pattern extract; secured distributed document clustering; secured hierarchically distributed peer-to-peer architecture; structured peer to peer networks; supernodes; Approximation algorithms; Clustering algorithms; Computer architecture; Data mining; Data models; Peer to peer computing; Signal processing algorithms; associativity; distributed document clustering; hierarchical P2P network; keyphrases; supernodes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing, Communication, Computing and Networking Technologies (ICSCCN), 2011 International Conference on
  • Conference_Location
    Thuckafay
  • Print_ISBN
    978-1-61284-654-5
  • Type

    conf

  • DOI
    10.1109/ICSCCN.2011.6024533
  • Filename
    6024533