• DocumentCode
    645623
  • Title

    A distributed semantic similar search for high-dimensional resources in low-dimensional content addressable network

  • Author

    Hu, Qingyuan ; Zhang, Chunhong ; Ji, Yang

  • Author_Institution
    Mobile Life and New Media Laboratory, Beijing University of Posts and Telecommunications, Beijing, China
  • fYear
    2013
  • fDate
    8-11 Sept. 2013
  • Firstpage
    3548
  • Lastpage
    3552
  • Abstract
    A mechanism for distributed semantic similar resource search is proposed in P2P network. The mechanism is based on the content addressable network (CAN). CAN, one of P2P networks, has the natural ability to support the semantic similar search with the semantic vector space model (SVSM) of resources. However, there exists a mismatching problem between the low-dimension CAN network and the high-dimension resources, which needs a dimensionality reduction algorithm. For the semantic similar search in distributed environment of CAN, the applied dimensionality reduction algorithm needs to meet two specific requirements: maintenance for semantic similarity of SVSM of resources, and distributed computing with large and dynamic data, which is not well researched. A distributed algorithm called D-PCA is proposed based on the statistical characteristic of resources in each node. It extracts the principal components of original high-dimensional SVSM to reduce the dimension in a distributed way. D-PCA is taken as a novel hash function to project high-dimensional SVSM into low-dimensional space of distributed hash table in CAN. A semantic indexing and searching process based on semantic DHT in CAN are simulated to show the applicability of D-PCA and the effectiveness of semantic similar search.
  • Keywords
    Correlation; Heuristic algorithms; Peer-to-peer computing; Principal component analysis; Semantics; Testing; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Personal Indoor and Mobile Radio Communications (PIMRC), 2013 IEEE 24th International Symposium on
  • Conference_Location
    London, United Kingdom
  • ISSN
    2166-9570
  • Type

    conf

  • DOI
    10.1109/PIMRC.2013.6666764
  • Filename
    6666764