Automated query generation Of Rdbms for information and knowledge extraction

Author

Arun, Abhishek ; Srinivasan, P.

Author_Institution

Dept. of CSE, Muthayammal Eng. Coll., Rasipuram, India

fYear

2013

fDate

21-22 Feb. 2013

Firstpage

468

Lastpage

473

Abstract

Information extraction systems traditionally implemented as a pipeline of special-purpose processing modules targeting the extraction of a particular kind of information. A major drawback is, whenever a new extraction goal emerges or a module is improved, extraction has to be reapplied from scratch to the entire text corpus even though a small part of corpus might be affected. By using database queries, information extraction enables the generic extraction and minimizes re-processing of data. Furthermore, this provides automated query generation components so that, casual users no need to learn the query language in order to perform extraction. To demonstrate the feasibility of our incremental extraction approach, experiments can be performed to highlight two important aspects of an information extraction system: efficiency and quality of extraction results. The existing extraction frameworks do not provide the capabilities of managing intermediate processed data such as parse trees and information. It is also extended to per-sentence extraction, it is important to notice that the query language itself is capable of defining patterns across multiple sentences. Hence, in order to provide nearest and good result the incremental approach is compared with the existing systems.

Keywords

data mining; database management systems; pipeline processing; query languages; query processing; text analysis; RDBMS; automated query generation components; data reprocessing minimization; database queries; extraction results efficiency; extraction results quality; incremental extraction approach; information extraction system; intermediate processed data management; knowledge extraction; per-sentence extraction; pipeline processing; query language; special-purpose processing modules; text corpus; text mining; Data mining; Database languages; Databases; Feature extraction; Information retrieval; Text processing; Writing; Text Mining; information storage and information retrieval; query languages;

fLanguage

English

Publisher

ieee

Conference_Titel

Information Communication and Embedded Systems (ICICES), 2013 International Conference on

Conference_Location

Chennai

Print_ISBN

978-1-4673-5786-9

Type

conf

DOI

10.1109/ICICES.2013.6508210

Filename

6508210