DocumentCode :
707442
Title :
Application of data mining techniques to study the sex ratio distortion in crossbred dairy cattle
Author :
Raja, T.V. ; Ruhil, A.P. ; Gandhi, R.S.
Author_Institution :
Nat. Dairy Res. Inst., Karnal, India
fYear :
2015
fDate :
11-13 March 2015
Firstpage :
1172
Lastpage :
1175
Abstract :
The present investigation was undertaken to find the efficiency of three data mining techniques to study the sex ratio distortion in crossbred dairy cattle. Information on the date of birth, sex of calf, sire and dam identification number, parity number of dam and date of conception of dam of 873 crossbred calves born to 89 bulls at Cattle Breeding Farm, Thumburmuzhi, Kerala for the period from 1991 to 2005 were utilized for the present study. The three classificatory techniques viz., logistic regression, decision tree and artificial neural networks were employed using the Weka (Witten and Frank, 2005) software. Confusion matrix was developed by each classifier and evaluation of the technique was done by using parameters viz. kappa statistic, mean absolute error, root mean squared error, relative absolute error and root relative squared error. The results revealed that classification by logistic regression was the best as it had maximum (58.72) percentage of correctly classified instances; while the Decision tree (52.75) and artificial neural network (52.29) classifiers had almost similar efficiency. Based on the results obtained in the present study it may be concluded that the sex of calf can be classified using various attributes at an accuracy of 58.70 per cent. The decision tree understands and classifies the problem at a faster rate than the logistic regression and artificial neural network techniques. However, the logistic regression classifies the sex of the calf at a more accurate and better way than the other two classification techniques. Hence among the three techniques used the logistic regression was recommended for the classification of sex of calf in dairy cattle.
Keywords :
dairying; data mining; Weka software; artificial neural networks; classifiers; confusion matrix; crossbred dairy cattle; dam identification number; data mining techniques; decision tree; kappa statistic; logistic regression; mean absolute error; relative absolute error; root mean squared error; root relative squared error; sex ratio distortion; Decision support systems; Economic indicators; Handheld computers; Crossbred cattle; Data mining; Decision tree; Logistic regression; Neural network; Sex ratio;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing for Sustainable Global Development (INDIACom), 2015 2nd International Conference on
Conference_Location :
New Delhi
Print_ISBN :
978-9-3805-4415-1
Type :
conf
Filename :
7100432
Link To Document :
بازگشت