Title :
Minimum Information Loss Based Multi-kernel Learning for Flagellar Protein Recognition in Trypanosoma Brucei
Author :
Wang, Jim Jing-Yan ; Xin Gao
Author_Institution :
Electr. & Math. Sci. & Eng. Div., King Abdullah Univ. of Sci. & Technol. (KAUST), Thuwal, Saudi Arabia
Abstract :
Trypanosma brucei (T. Brucei) is an important pathogen agent of African trypanosomiasis. The flagellum is an essential and multifunctional organelle of T. Brucei, thus it is very important to recognize the flagellar proteins from T. Brucei proteins for the purposes of both biological research and drug design. In this paper, we investigate computationally recognizing flagellar proteins in T. Brucei by pattern recognition methods. It is argued that an optimal decision function can be obtained as the difference of probability functions of flagella protein and the non-flagellar protein for the purpose of flagella protein recognition. We propose to learn a multi-kernel classification function to approximate this optimal decision function, by minimizing the information loss of such approximation which is measured by the Kull back - Leibler (KL) divergence. An iterative multi-kernel classifier learning algorithm is developed to minimize the KL divergence for the problem of T. Brucei flagella protein recognition, experiments show its advantage over other T. Brucei flagellar protein recognition and multi-kernel learning methods.
Keywords :
approximation theory; cellular biophysics; diseases; drugs; iterative methods; learning (artificial intelligence); medical computing; microorganisms; molecular biophysics; pattern classification; probability; proteins; African trypanosomiasis; KL divergence; Kullback Leibler divergence; T. brucei proteins; Trypanosome brucei; approximation; biological research; drug design; flagellar protein recognition; iterative multikernel classifier learning algorithm; minimum information loss based multikernel learning; multifunctional organelle; multikernel classification function; nonflagellar protein; optimal decision function; pathogen agent; pattern recognition methods; probability functions; Approximation methods; Feature extraction; Kernel; Loss measurement; Proteins; Training; Vectors; Flagellar protein; Information Loss; Kullback Leibler divergence; Multi-Kernel Learning; Trypanosma brucei;
Conference_Titel :
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4275-6
DOI :
10.1109/ICDMW.2014.142