Title : 
Enhancing performance of Hadoop and MapReduce for scientific data using NoSQL database
         
        
            Author : 
Alshammari, Hamoud ; Bajwa, Hassan ; Jeongkyu Lee
         
        
            Author_Institution : 
Dept. of Comput. Sci., Univ. of Bridgeport, Bridgeport, CT, USA
         
        
        
        
        
        
            Abstract : 
Scientific data sets usually have similar jobs that are frequently applied to them by different users. In addition, many of these data sets are unstructured and complex, and required fast and simple processing. In order to increase the performance of the existing Hadoop and MapReduce algorithm, it is necessary to develop an algorithm based on the type of data sets and requirements of the jobs. Genomic and biological data is an example of unstructured data because it only has a huge sequence of unreadable and non-relational letters. In this paper, we present an overview of a developed MapReduce algorithm and its simulation using HBase as a NoSQL database.
         
        
            Keywords : 
parallel algorithms; relational databases; HBase; Hadoop algorithm; MapReduce algorithm; NoSQL database; biological data; genomic data; Bioinformatics; Biological cells; Computer architecture; Computer science; DNA; Databases; Genomics; BigData; HBase; Hadoop; MapReduce; NoSQL Database;
         
        
        
        
            Conference_Titel : 
Systems, Applications and Technology Conference (LISAT), 2015 IEEE Long Island
         
        
            Conference_Location : 
Farmingdale, NY
         
        
        
            DOI : 
10.1109/LISAT.2015.7160180