DocumentCode :
3501188
Title :
The realization of the distributed search engine on cloud platform
Author :
Li Ling ; Fu Yuan ; Ma Xiaozhen ; Zhang Hairong ; Zhang Yi
Author_Institution :
Inst. of Commun. Eng., Jilin Univ., Changchun, China
Volume :
01
fYear :
2013
fDate :
16-18 Aug. 2013
Firstpage :
691
Lastpage :
695
Abstract :
With the rapid development of the Internet and websites, the amount of web data increases exponentially, which leads to great challenges of the traditional centralized search engine in the real-time search, response speed and the storage of mass pages. Cloud computer, which can integrate resources of many PCs to provide distributed storage and parallel computing, has many advantages in dealing with mass data. As a result, a distributed search engine on cloud platform is proposed in this paper. It consists of three main modules: crawling, indexing and retrieving. Furthermore, it also presents visual user interaction interfaces. The search engine is implemented on Hadoop, with Hadoop Distributed File System (HDFS) for storage and parallel programming model MapReduce to realize indexing and retrieving functions. The search engine on Hadoop can store mass pages with many cheap machines, retrieve the wanted information from mass data as shown in function test and reduce query time as shown in performance test, so it is economical, accurate and efficient.
Keywords :
Web sites; cloud computing; distributed databases; indexing; parallel programming; query processing; search engines; storage management; user interfaces; HDFS; Hadoop distributed file system; Internet; Websites; centralized search engine; cloud computer; cloud platform; crawling module; distributed search engine; distributed storage; function test; indexing module; information retrieval; mass page storage; parallel computing; parallel programming model MapReduce; query time reduction; response speed; retrieving module; visual user interaction interfaces; Indexes; Lead; Optimized production technology; Cloud Platform; HDFS; MapReduce; Search Engine;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Measurement, Information and Control (ICMIC), 2013 International Conference on
Conference_Location :
Harbin
Print_ISBN :
978-1-4799-1390-9
Type :
conf
DOI :
10.1109/MIC.2013.6758056
Filename :
6758056
Link To Document :
بازگشت