• DocumentCode
    1825758
  • Title

    Building a research library for the history of the web

  • Author

    Arms, William Y. ; Aya, Selcuk ; Dmitriev, Pavel ; Kot, Blazej J. ; Mitchell, Ruth ; Walle, Lucia

  • Author_Institution
    Comput. Sci. Dept., Cornell Univ., Ithaca, NY
  • fYear
    2006
  • fDate
    38869
  • Firstpage
    95
  • Lastpage
    102
  • Abstract
    This paper describes the building of a research library for studying the Web, especially research on how the structure and content of the Web change over time. The library is particularly aimed at supporting social scientists for which the Web is both a fascinating social phenomenon and a mirror on society. The library is built on the collections of the Internet archive, which has been preserving a crawl of the Web every two months since 1996. The technical challenges in organizing this data for research fall into two categories: high-performance computing to transfer and manage the very large amounts of data, and human-computer interfaces that empower research by non-computer specialists
  • Keywords
    Internet; human computer interaction; relational databases; research libraries; Internet archive; human-computer interface; research library; social scientist; Arm; Buildings; Computer interfaces; Computer science; History; Information science; Internet; Mirrors; Organizing; Software libraries; computational social science; digital libraries; history of the web; internet archive;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Libraries, 2006. JCDL '06. Proceedings of the 6th ACM/IEEE-CS Joint Conference on
  • Conference_Location
    Chapel Hill, NC
  • Print_ISBN
    1-59593-354-9
  • Type

    conf

  • DOI
    10.1145/1141753.1141771
  • Filename
    4119103