Title :
Vector quantization, density estimation and outlier detection on cricket dataset
Author :
Parameswaran, K.
Author_Institution :
Dept. of Electron. & Telecommun. Eng., Univ. of Moratuwa, Moratuwa, Sri Lanka
Abstract :
This study aims to apply unsupervised machine learning algorithms on Cricket players´ career statistics dataset. K-means clustering algorithm is used to find the natural grouping that exists within the cricket players using player´s batting average, strike rate, bowling average, economy etc. as input features - in this case players are grouped into 3 groups. Further separate probability density models are fitted for batsmen, bowlers and all-rounding players using appropriate player´s performance metrics as input features and using these models, outstanding players are identified. Similar method is used to identify match winning players, where the differences between player´s performance metrics and team´s average performance metrics are used as input features. The results obtained from this study seem to correlate with expert generated results where they used point based system to rank the players. This kind of statistical analysis of sports data plays a vital role in team planning and exploiting opponents´ weakness.
Keywords :
estimation theory; pattern clustering; probability; sport; statistical analysis; unsupervised learning; vector quantisation; Cricket player career statistics dataset; all-rounding players; batsmen; bowlers; density estimation; input features; k-means clustering algorithm; match winning player identification; opponent weakness exploitation; outlier detection; player natural grouping; player performance metrics; player ranking; point-based system; probability density models; sports data statistical analysis; team average performance metrics; team planning; unsupervised machine learning algorithms; vector quantization; Biological system modeling; Clustering algorithms; Computers; Engineering profession; Estimation; Informatics; Measurement; Cricket; Density estimation; K-means clustering; Outlier detection;
Conference_Titel :
Computer Communication and Informatics (ICCCI), 2013 International Conference on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4673-2906-4
DOI :
10.1109/ICCCI.2013.6466249