Title : 
Empirical validation of website quality using statistical and machine learning methods
         
        
            Author : 
Dhiman, Poonam ; Anjali
         
        
            Author_Institution : 
Dept. of Software Eng., Delhi Technol. Univ., New Delhi, India
         
        
        
        
        
        
            Abstract : 
The analysis of quantitative measure of large set of websites plays a significant role in evaluating the quality of websites. The paper, computes 22 metrics using a tool developed in MATLAB. Website quality prediction is developed using statistical and some machine learning methods. The work has been validated using dataset collected from webby awards web site. The results are analyzed using Area Under the Curve (AUC) obtained from Receiver Operating Characteristics (ROC) analysis. The results show that the model predicted using the random forest and Bayes Net methods outperformed over all the other models. Hence, based on these results it is reasonable to claim that quality models have a significant relevance with design metrics and the machine learning methods have a comparable performance with statistical methods. Univariate analysis results provide an empirical view for website design guidance and suggest which metrics are more important for website development.
         
        
            Keywords : 
Bayes methods; Web design; belief networks; learning (artificial intelligence); sensitivity analysis; statistical analysis; AUC analysis; Bayes net methods; MATLAB; ROC analysis; Website design guidance; Website development; Website quality evaluation; Website quality prediction; area under the curve analysis; empirical Website quality validation; machine learning methods; random forest; receiver operating characteristics analysis; statistical methods; univariate analysis; Analytical models; Awards activities; Learning systems; Measurement; Predictive models; Statistical analysis; Web pages; Empirical Validation; Machine Learning; Receiver Operating Characteristics; Statistical Methods; Web page;
         
        
        
        
            Conference_Titel : 
Confluence The Next Generation Information Technology Summit (Confluence), 2014 5th International Conference -
         
        
            Conference_Location : 
Noida
         
        
            Print_ISBN : 
978-1-4799-4237-4
         
        
        
            DOI : 
10.1109/CONFLUENCE.2014.6949363