Title :
HCRPDB: a database retrieval & mining system for human cell receptors proteins
Author :
Choudhari, Ajay ; Gangadhar, S. ; Agarwal, Sankalp
Author_Institution :
Indian Inst. of Inf. Technol., Uttar Pradesh, India
Abstract :
The information on cell receptors is an important base for understanding living systems, diseases and for designing new drugs. The human cell receptor protein database (HCRPDB; http://profile.iiita.ac.in/achoudhary_01/HCRPDB/index.htm) is an integrated relational repository of human cell receptor protein sequences and data mining system. It stores almost every receptor information belonging to human. An algorithm was developed to facilitate the automatic parsing of large number of protein sequences and related information from Gen Bank, SwissProt, PIR, EMBL, DBJ and PDB. The parser was made using XML (Extensible Markup Language), Java programming and JDOM (Java Document Object Model) library. It extracts information relevant to HCRPDB from the above mentioned databanks. The extracted information is then correlated with the metadata of HCRPDB knowledge base to encode extracted unstructured text into the structured format based on design of entity attribute value with classes and relationship.
Keywords :
Java; XML; biology computing; data mining; entity-relationship modelling; grammars; human computer interaction; information retrieval; knowledge based systems; meta data; program compilers; proteins; relational databases; scientific information systems; software libraries; Extensible Markup Language; HCRPDB knowledge base; JDOM; Java Document Object Model library; Java programming; XML; automatic parsing; data mining system; database retrieval; entity attribute value; human cell receptor protein database; information extraction; integrated relational repository; metadata; user-friendly retrieval; Data mining; Diseases; Drugs; Humans; Information retrieval; Java; Libraries; Protein engineering; Relational databases; XML;
Conference_Titel :
Intelligent Sensing and Information Processing, 2005. Proceedings of 2005 International Conference on
Print_ISBN :
0-7803-8840-2
DOI :
10.1109/ICISIP.2005.1529505