Title :
Extraction of User Profile Based on the Hadoop Framework
Author :
Huang, Lan ; Wang, Xiao-Wei ; Zhai, Yan-Dong ; Yang, Bin
Author_Institution :
Coll. of Comput. Sci. & Technol., Jilin Univ., Changchun, China
Abstract :
With the rapid development of Internet, the Web information dramatically increases, the users are often involved in voluminous information to feel lose, Distributed processing of mass data through a cluster composed by many machines and personalized search services based on the user profile have been the hotspots of research and development. This paper firstly studies the operation mechanism of Hadoop, which is a typical distributed processing framework of Apache, then realizes extraction of user profile from a large number of Web log data and through comparison experiment with single machine to verify its efficiency.
Keywords :
Internet; data mining; information retrieval; Apache distributed processing framework; Hadoop framework; Internet; Web information; Web log data; mass data distributed processing; personalized search service; user profile extraction; Data mining; Data processing; Distributed processing; Fault tolerance; File systems; Java; Logic; Parallel processing; Programming profession; Research and development;
Conference_Titel :
Wireless Communications, Networking and Mobile Computing, 2009. WiCom '09. 5th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-3692-7
Electronic_ISBN :
978-1-4244-3693-4
DOI :
10.1109/WICOM.2009.5305856