DocumentCode :
3741342
Title :
High speed streaming data analysis of web generated log streams
Author :
Sonali Agarwal;Bakshi Rohit Prasad
Author_Institution :
Indian Institute of Information Technology Allahabad, Uttar Pradesh, India
fYear :
2015
Firstpage :
413
Lastpage :
418
Abstract :
Web logs provide useful insight of large scale web based applications and helpful in deriving web usage patterns. Since, web usage patterns are available at a high rate and a high volume and also continuously updating in a real time environment, must be handled through modern big data architectures supported by powerful real time big data processing tools. Web generated log streams have most significant impact when it is feasible to analyze them at a time when they are emitted. In proposed research work, an advanced stream analytics framework especially for web generated log streams has been proposed by using the dataset of web access logs representing HTTP requests received by NASA Kennedy Space Center Server. The proposed framework can resourcefully handle the challenging issues associated to manage multiple web based log streams that are distributed across a fleet of web based applications and present a summarized view of statistical profile of web based applications which may be useful for web usage mining.
Keywords :
"Servers","Monitoring","Sparks","HTML","Uniform resource locators","Data mining","Queueing analysis"
Publisher :
ieee
Conference_Titel :
Industrial and Information Systems (ICIIS), 2015 IEEE 10th International Conference on
Print_ISBN :
978-1-5090-1741-6
Type :
conf
DOI :
10.1109/ICIINFS.2015.7399047
Filename :
7399047
Link To Document :
بازگشت