Title :
Data mining and analysis in depth. case study of Qafqaz University HTTP server log analysis
Author :
Adamov, Abzetdin
Author_Institution :
Appl. Res. Center for Data Analytics & Web Insights (CeDAWI), Qafqaz Univ., Baku, Azerbaijan
Abstract :
The Internet Services, Web and Mobile Applications, Pervasive Communication widely available today meeting many of our needs and stimulating production of tremendous amounts of data. Over 90% of this information is unstructured, what means data does not have predefined structure and model. Generally, unstructured data is useless unless applying data mining or data extraction techniques. At the same time, just in case if we are able to process and understand data, this data worth anything, otherwise it becomes useless. Although, small part of this huge amount is structured (logs) or semi-structured (email, website), it is difficult to process and manage this data without advanced data analytics techniques. This paper provides an example of applying Data Mining and Analysis techniques on the data generated by HTTP Server Logs. Experimental results show that proposed analysis approach based on Regular Expressions is highly efficient and flexible. Results of such analysis are highly beneficial for any company which concerns about efficiency of their Internet-presence giving them important information based on the real data.
Keywords :
Web services; data analysis; data mining; mobile computing; Internet Services; Qafqaz University HTTP server log analysis; Web services; data analysis; data extraction techniques; data mining; mobile applications; pervasive communication; regular expressions; Browsers; Data collection; Data preprocessing; Web mining; Web servers; Data Analysis; Data Mining; Data Preprocessing; R programming; Web Mining; Web Usage Mining;
Conference_Titel :
Application of Information and Communication Technologies (AICT), 2014 IEEE 8th International Conference on
Conference_Location :
Astana
Print_ISBN :
978-1-4799-4120-9
DOI :
10.1109/ICAICT.2014.7035947