DocumentCode :
249227
Title :
A performance comparison of scheduling distributed mining in cloud
Author :
Srikrishnan, V. ; Sivasankar, E. ; Pitchiah, R.
Author_Institution :
Centre for Dev. of Adv. Comput., Chennai, India
fYear :
2014
fDate :
19-20 Aug. 2014
Firstpage :
375
Lastpage :
379
Abstract :
Indexing play an indispensable role in Search Engine. Indexing empower ease of mining of data and lessen the latency of searching a term in huge documents. In this paper, we propose a methodology to index documents in a parallel - distributed manner. Define Metadata structure of a document for indexing; from the metadata, the occurrence of a word shall be ascertained by document wise, page number and up to Line number. This paper compares the average waiting time and average time of completion of indexing job of the three algorithms ( FIFO [First in First Out], SJF [Shortest Job First] and Lottery Scheduling) in the cloud environment. Later we propose an algorithm to reduce the average waiting time of indexing job. This methodology utilizes benefits of Cloud Computing, Virtualization, NoSQL and Distributed indexing.
Keywords :
cloud computing; data mining; document handling; indexing; meta data; scheduling; search engines; virtualisation; NoSQL; cloud computing; completion time; data mining; distributed indexing; distributed mining scheduling; document indexing; indexing job; line number; metadata structure; performance comparison; search engine; virtualization; waiting time; Indexing; Scheduling; Scheduling algorithms; Servers; Software; Virtual machining; Cloud Computing; Data Mining; NoSQL Scheduling Algorithm; Virtualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Networks & Soft Computing (ICNSC), 2014 First International Conference on
Conference_Location :
Guntur
Print_ISBN :
978-1-4799-3485-0
Type :
conf
DOI :
10.1109/CNSC.2014.6906710
Filename :
6906710
Link To Document :
بازگشت