DocumentCode
478602
Title
Profile-Based Focused Crawler for Social Media-Sharing Websites
Author
Zhang, Zhiyong ; Nasraoui, Olfa
Volume
1
fYear
2008
fDate
3-5 Nov. 2008
Firstpage
317
Lastpage
324
Abstract
In this paper, we present a novel profile based focused crawling system for dealing with increasingly popular social media-sharing Web sites. In this system, we treat users´ profiles as ranking criteria for guiding the crawling process. Furthermore, we divide a user´s profile into two parts, an internal part, which comes from the user´s own contribution, and an external part, which comes from the user´s social contacts. In order to efficiently and effectively extract data from a social media-sharing website for focused crawling, a path string based page-classification method was first developed for identifying list pages, detail pages and profile pages.
Keywords
Web sites; pattern classification; social sciences computing; path string based page-classification method; profile based focused crawling system; social media-sharing Websites; Artificial intelligence; Computer science; Crawlers; Data mining; Learning systems; Search engines; Support vector machines; Taxonomy; Web sites; YouTube; focused crawl; profile; social;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence, 2008. ICTAI '08. 20th IEEE International Conference on
Conference_Location
Dayton, OH
ISSN
1082-3409
Print_ISBN
978-0-7695-3440-4
Type
conf
DOI
10.1109/ICTAI.2008.119
Filename
4669706
Link To Document