• DocumentCode
    650642
  • Title

    Tape Cloud: Scalable and Cost Efficient Big Data Infrastructure for Cloud Computing

  • Author

    Prakash, Varun S. ; Yuanfeng Wen ; Weidong Shi

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Houston, Houston, TX, USA
  • fYear
    2013
  • fDate
    June 28 2013-July 3 2013
  • Firstpage
    541
  • Lastpage
    548
  • Abstract
    Magnetic tapes have been a primary medium of backup storage for a long time in many organizations. In this paper, the possibility of establishing an inter-network accessible, centralized, tape based data backup facility is evaluated. Our motive is to develop a cloud storage service that organizations can use for long term storage of big data which is typically Write-Once-Read-Many. This Infrastructure-as-a-Service (IaaS) cloud can provide the much needed cost effectiveness in storing huge amounts of data exempting client organizations from high infrastructure investments. We make an attempt to understand some of the limitations induced by the usage of tapes by studying the latency of tape libraries in scenarios most likely faced in the backing up process in comparison to its hard disk counterpart. The result of this study is an outline of methods to overcome these limitations by adopting novel tape storage architectures, filesystem, schedulers to manage data transaction requests from various clients and develop faster ways to retrieve requested data to extend the applications beyond backup. We use commercially available tapes and a tape library to perform latency tests and understand the basic operations of tape. With the optimistic backing of statistics that suggests the extensive usage of tapes to this day and in future, we propose an architecture to provide data backup to a large and diverse client base.
  • Keywords
    back-up procedures; cloud computing; hard discs; magnetic tape storage; scheduling; storage management; very large databases; IaaS cloud; backing up process; backup storage; big data infrastructure; client organizations; cloud computing; cloud storage service; cost effectiveness; data storage; data transaction requests management; file system; hard disk; infrastructure investments; infrastructure-as-a-service cloud; inter-network accessible-centralized tape based data backup facility; latency tests; long term storage; magnetic tapes; requested data retrieval; schedulers; tape cloud; tape libraries latency; tape storage architectures; write-once-read-many; Cloud computing; Hard disks; Hardware; Libraries; Media; Organizations; Servers; Big Data; Cloud; Cost Efficient; Storage; Tape;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing (CLOUD), 2013 IEEE Sixth International Conference on
  • Conference_Location
    Santa Clara, CA
  • Print_ISBN
    978-0-7695-5028-2
  • Type

    conf

  • DOI
    10.1109/CLOUD.2013.129
  • Filename
    6676738