• DocumentCode
    3582133
  • Title

    Analysis of protein database for semantic similarity using map reduce — A survey

  • Author

    Nirmala, G. ; Dinakaran, K.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., RMD Eng. Coll., Chennai, India
  • fYear
    2014
  • Firstpage
    46
  • Lastpage
    50
  • Abstract
    Big data analytics is challenging for storage and analysis of large data sets. This paper is to determine the semantic relationships among Gene Ontology (GO) terms which are the attributes of protein database. This is implemented in a prototype system called PROSIM, which is an User interface with protein data collection. Gene ontology term is a collection of GO graph as an attribute in the database. A proposed technique that determine the contexts of terms based on the concept of existence dependency by map reduce in hadoop. Map Reduce is the parallel-processing engine that allows Hadoop to process and store the large data sets in relatively short order. Determining the semantic similarities among GO terms there by analysis and learning the affected proteins was done. The intensity and risk factor was concluded as per the classification.
  • Keywords
    Big Data; biology computing; data analysis; database management systems; ontologies (artificial intelligence); parallel processing; proteins; Big Data analytics; GO graph; GO term; Hadoop; MapReduce; PROSIM; gene ontology; protein data collection; protein database; semantic similarity; user interface; Big data; Biomedical measurement; Databases; Diseases; Ontologies; Proteins; Semantics; Gene ontology (GO); Map Reduce; data sets; related terms; semantic similarity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Communication and Systems, 2014 International Conference on
  • Print_ISBN
    978-1-4799-3671-7
  • Type

    conf

  • DOI
    10.1109/ICCCS.2014.7068166
  • Filename
    7068166