Title :
DIFTSAS: A DIstributed Full Text Search and Analysis System for Big Data
Author :
Bo Li ; Jingjie Zhang ; Mingyu Chen ; Jinchao Zhang ; Kunpeng Wang ; Dan Meng
Author_Institution :
Nat. Eng. Lab. for Inf. Security Technol., Inst. of Inf. Eng., Beijing, China
Abstract :
As we enter the big data era, digital world is growing fast and becomes more and more complex in nature, many content-oriented applications require both features of high performance full-text searching and rich functional online data analyzing. In this paper, we exploit the features of both web search and parallel DBMSs to address the challenges to build a scalable, interactive ad-hoc query system for analysis of incrementally accumulated content data, namely DIFTSAS. DIFTSAS is considered as a many-sided system, which is not only capable of handling full-text data as a search engine but also adept at online data analytical processing like a traditional data warehouse. In this paper, we present the design, implementation, and performance evaluation of DIFTSAS.
Keywords :
data analysis; query processing; search engines; text analysis; DIFTSAS; big data; content-oriented application; distributed full text search and analysis system; high performance full-text searching; incrementally accumulated content data analysis; interactive ad-hoc query system; rich functional online data analysis; search engine; Conferences; Scientific computing; data warehouse; distributed system; full text; search engine;
Conference_Titel :
Computational Science and Engineering (CSE), 2013 IEEE 16th International Conference on
Conference_Location :
Sydney, NSW
DOI :
10.1109/CSE.2013.193