Title :
Framework for Data Mining of Big Data Using Probabilistic Grammars
Author :
Aljoharah Algwaiz;Reda Ammar;Sanguthevar Rajasekaran
Author_Institution :
Dept. of Comput. Sci. &
Abstract :
Most commonly used and researched data mining approaches use neural and statistical methods to extract information for large data sources. This paper proposes a framework for a novel approach in data mining using Probabilistic Context Free Grammar (PCFG). The framework is proposed for two distributed data structures, dependent and independent. Escherichia Coli promoter DNA sequences dataset are chosen as a case study in this paper.
Keywords :
"Grammar","Data mining","Probabilistic logic","DNA","Production","Distributed databases","Big data"
Conference_Titel :
e-Learning (econf), 2015 Fifth International Conference on
DOI :
10.1109/ECONF.2015.50