Title :
An Efficient Algorithm for Hierarchical Classification of Protein and Gene Functions
Author :
Fabris, Fabio ; Freitas, Alex A.
Author_Institution :
Sch. of Comput., Univ. of Kent, Canterbury, UK
Abstract :
The classification of protein and gene functions is a complex problem that is becoming more relevant as the number of sequenced genes and proteins increases. This work presents a modified version of the Extended Local Hierarchical Naive Bayes algorithm, which exploits the requirements of the original algorithm (single-path, mandatory-leaf-prediction hierarchical classification problems in tree-structured class hierarchies) to greatly improve classification run-time. We show that, considering 18 hierarchical classification datasets, the modified algorithm yields equivalent predictive performance and significantly improves run-time in the training and prediction phases.
Keywords :
Bayes methods; biology computing; genetics; pattern classification; proteins; classification run-time; extended local hierarchical naive Bayes algorithm; gene function classification; hierarchical classification datasets; prediction phases; predictive performance; protein function classification; sequenced genes; single-path mandatory-leaf-prediction hierarchical classification problems; training phases; tree-structured class hierarchies; Bioinformatics; Data mining; Prediction algorithms; Probability; Proteins; Time complexity; Training;
Conference_Titel :
Database and Expert Systems Applications (DEXA), 2014 25th International Workshop on
Conference_Location :
Munich
Print_ISBN :
978-1-4799-5721-7
DOI :
10.1109/DEXA.2014.29