Title :
Web Site Management System through Private Information Extraction
Author :
Choi, Myung Sil ; Park, Yong Soo ; Ahn, Kwang Seon
Author_Institution :
Dept. of Comput. Eng., Kyungpook Nat. Univ., Daegu
Abstract :
This paper presents a management system to effectively extract private information embedded in certain Web sites. To protect the target private information, it is necessary to first collect all private information from the target Web site, because vulnerable information leakages should be analyzed so as to come up with appropriate protection means. We use a crawling method in order to collect private information from the Web site. And a Web site is ordered based on collected information with structured. This way, we use a directed graph, determine Web documents as a kind of nodes, and assign a weighting to documents containing private information, thus addressing time and economic problems for every crawling. We experimented with actual Web sites, demonstrating that our crawling method was superior in extracting and analyzing private information from websites.
Keywords :
Web sites; directed graphs; information retrieval; search engines; Web site management system; crawling method; directed graph; private information extraction; Application software; Computer network management; Conference management; Crawlers; Data mining; Engineering management; Information analysis; Protection; Search engines; Uniform resource locators; Crawler; Private Information; Web Site;
Conference_Titel :
Advanced Information Networking and Applications Workshops, 2009. WAINA '09. International Conference on
Conference_Location :
Bradford
Print_ISBN :
978-1-4244-3999-7
Electronic_ISBN :
978-0-7695-3639-2
DOI :
10.1109/WAINA.2009.114