• DocumentCode
    2827566
  • Title

    A Resource-Efficient Method for Crawling Swarm Information in Multiple BitTorrent Networks

  • Author

    Yoshida, Masahiro ; Nakao, Akihiro

  • Author_Institution
    Univ. of Tokyo, Tokyo, Japan
  • fYear
    2011
  • fDate
    23-27 March 2011
  • Firstpage
    497
  • Lastpage
    502
  • Abstract
    Bit Torrent is one of the most popular P2P file sharing applications in the world. Each Bit Torrent network is called a swarm and millions of peers may join multiple swarms. Due to swarm´s large network size and complexity, many resources (PC servers, the Internet connection, etc.) are required for measuring all the swarms in the world. For this reason, the existing work is forced to measure only a part of the entire set of swarms, thus, ends up understanding only a part of it. In this paper, we propose a resource-efficient method for crawling multiple Bit Torrent swarms by only a limited amount of resources such as a single PC server. In the proposed method, our crawler avoids collecting redundant information of swarms without pressing WAN access links nor expending much processing resources. We also use a number of techniques to efficiently crawl all the participating peers of multiple swarms. We crawl over 4.3 million unique .torrent files, small files that store metadata used in Bit Torrent, and 48,000 tracker addresses. We can crawl 4.3 million swarms within an hour. We obtain 24 swarm snapshots and 10 million unique peers in a day.
  • Keywords
    information resources; peer-to-peer computing; wide area networks; P2P file sharing applications; WAN access links; crawling swarm information; multiple BitTorrent networks; resource-efficient method; Crawlers; Internet; Peer to peer computing; Redundancy; Servers; Web pages; BitTorrent; Network Measurement; P2P; Resource-Efficient Measurement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Autonomous Decentralized Systems (ISADS), 2011 10th International Symposium on
  • Conference_Location
    Tokyo & Hiroshima
  • Print_ISBN
    978-1-61284-213-4
  • Type

    conf

  • DOI
    10.1109/ISADS.2011.72
  • Filename
    5741398