Title :
Automatic classification of cross-site scripting in web pages using document-based and URL-based features
Author :
Nunan, Angelo Eduardo ; Souto, Eduardo ; Santos, Eulanda M dos ; Feitosa, Eduardo
Author_Institution :
Inst. of Comput. (ICOMP), Fed. Univ. of Amazonas, Manaus, Brazil
Abstract :
The structure of dynamic websites comprised of a set of objects such as HTML tags, script functions, hyperlinks and advanced features in browsers lead to numerous resources and interactiveness in services currently provided on the Internet. However, these features have also increased security risks and attacks since they allow malicious codes injection or XSS (Cross-Site Scripting). XSS remains at the top of the lists of the greatest threats to web applications in recent years. This paper presents the experimental results obtained on XSS automatic classification in web pages using Machine Learning techniques. We focus on features extracted from web document content and URL. Our results demonstrate that the proposed features lead to highly accurate classification of malicious page.
Keywords :
Web sites; document handling; learning (artificial intelligence); pattern classification; security of data; HTML tag; Internet; URL-based feature; Web application; Web document content; Web page; Web site; XSS; XSS automatic classification; browser feature; cross-site scripting; cross-site scripting classification; document-based feature; hyperlink; machine learning technique; malicious codes injection; malicious page classification; script function; security attack; security risk; service interactiveness; service resource; Browsers; Databases; Encoding; Feature extraction; HTML; Support vector machines; Web pages; cross-site scripting; machine learning; scripting languages security; web application security;
Conference_Titel :
Computers and Communications (ISCC), 2012 IEEE Symposium on
Conference_Location :
Cappadocia
Print_ISBN :
978-1-4673-2712-1
Electronic_ISBN :
1530-1346
DOI :
10.1109/ISCC.2012.6249380