DocumentCode :
3777693
Title :
Extraction of latent concepts from an integrated human gene database: Non-negative matrix factorization for identification of hidden data structure
Author :
Katsuhiko Murakami
Author_Institution :
School of Bioscience and Biotechnology, Tokyo University of Technology, Tokyo, Japan
fYear :
2015
Firstpage :
346
Lastpage :
350
Abstract :
Information in genetic databases often describes complex concepts, such as diseases and gene functions having implicit relationships. However, such information is presented as independent concepts (for example, “genes” and “function”), making it difficult for the user, even specialists, to understand their meaning in relation to one another. This facilitates the need for extraction of hidden relationships among biological concepts, and for the addition of this information to databases. Therefore, we factorized a gene data matrix and extracted hidden relationships among both genes and their functional terms. We successfully identified composite concepts explained by plural genes and plural terms. This re-organization provides new insights for researchers and is helpful for interpretation of information.
Keywords :
"Databases","Gene expression","Proteins","Matrix decomposition","Data mining","DNA","Cost function"
Publisher :
ieee
Conference_Titel :
Soft Computing and Pattern Recognition (SoCPaR), 2015 7th International Conference of
Type :
conf
DOI :
10.1109/SOCPAR.2015.7492771
Filename :
7492771
Link To Document :
بازگشت