• DocumentCode
    249227
  • Title

    A performance comparison of scheduling distributed mining in cloud

  • Author

    Srikrishnan, V. ; Sivasankar, E. ; Pitchiah, R.

  • Author_Institution
    Centre for Dev. of Adv. Comput., Chennai, India
  • fYear
    2014
  • fDate
    19-20 Aug. 2014
  • Firstpage
    375
  • Lastpage
    379
  • Abstract
    Indexing play an indispensable role in Search Engine. Indexing empower ease of mining of data and lessen the latency of searching a term in huge documents. In this paper, we propose a methodology to index documents in a parallel - distributed manner. Define Metadata structure of a document for indexing; from the metadata, the occurrence of a word shall be ascertained by document wise, page number and up to Line number. This paper compares the average waiting time and average time of completion of indexing job of the three algorithms ( FIFO [First in First Out], SJF [Shortest Job First] and Lottery Scheduling) in the cloud environment. Later we propose an algorithm to reduce the average waiting time of indexing job. This methodology utilizes benefits of Cloud Computing, Virtualization, NoSQL and Distributed indexing.
  • Keywords
    cloud computing; data mining; document handling; indexing; meta data; scheduling; search engines; virtualisation; NoSQL; cloud computing; completion time; data mining; distributed indexing; distributed mining scheduling; document indexing; indexing job; line number; metadata structure; performance comparison; search engine; virtualization; waiting time; Indexing; Scheduling; Scheduling algorithms; Servers; Software; Virtual machining; Cloud Computing; Data Mining; NoSQL Scheduling Algorithm; Virtualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Networks & Soft Computing (ICNSC), 2014 First International Conference on
  • Conference_Location
    Guntur
  • Print_ISBN
    978-1-4799-3485-0
  • Type

    conf

  • DOI
    10.1109/CNSC.2014.6906710
  • Filename
    6906710