Title :
Extending R Boxplot Analysis to Big Data in Education
Author :
Rajiv Pandey;Nidhi Srivastava;Shahnaz Fatima
Author_Institution :
Amity Inst. of Inf. Technol., Amity Univ., Lucknow, India
fDate :
4/1/2015 12:00:00 AM
Abstract :
Big Data is the buzz word doing rounds in all areas of human existence be medical, social networks, research, it has also made inroads to education. The large size and complexity of datasets in Big Data need specialized statistical tools for analysis where R can come handy. This paper explores the analysis of Big Data in education using a contemporary statistical tool R. R provides multiple dimensions to statistical analysis of dataset, this paper however explores only the Box Plot feature to study the impact of outliers on the overall summary measure of the dataset. The feature of trimmed mean is incorporated to demonstrate its impact on outliers. The trimmed data set can be used in predictive analysis for a business intelligence prediction or educational context.
Keywords :
"Big data","Education","Data mining","Data analysis","Complexity theory","Indexes","Real-time systems"
Conference_Titel :
Communication Systems and Network Technologies (CSNT), 2015 Fifth International Conference on
DOI :
10.1109/CSNT.2015.73