DocumentCode
2050776
Title
Automated query generation Of Rdbms for information and knowledge extraction
Author
Arun, Abhishek ; Srinivasan, P.
Author_Institution
Dept. of CSE, Muthayammal Eng. Coll., Rasipuram, India
fYear
2013
fDate
21-22 Feb. 2013
Firstpage
468
Lastpage
473
Abstract
Information extraction systems traditionally implemented as a pipeline of special-purpose processing modules targeting the extraction of a particular kind of information. A major drawback is, whenever a new extraction goal emerges or a module is improved, extraction has to be reapplied from scratch to the entire text corpus even though a small part of corpus might be affected. By using database queries, information extraction enables the generic extraction and minimizes re-processing of data. Furthermore, this provides automated query generation components so that, casual users no need to learn the query language in order to perform extraction. To demonstrate the feasibility of our incremental extraction approach, experiments can be performed to highlight two important aspects of an information extraction system: efficiency and quality of extraction results. The existing extraction frameworks do not provide the capabilities of managing intermediate processed data such as parse trees and information. It is also extended to per-sentence extraction, it is important to notice that the query language itself is capable of defining patterns across multiple sentences. Hence, in order to provide nearest and good result the incremental approach is compared with the existing systems.
Keywords
data mining; database management systems; pipeline processing; query languages; query processing; text analysis; RDBMS; automated query generation components; data reprocessing minimization; database queries; extraction results efficiency; extraction results quality; incremental extraction approach; information extraction system; intermediate processed data management; knowledge extraction; per-sentence extraction; pipeline processing; query language; special-purpose processing modules; text corpus; text mining; Data mining; Database languages; Databases; Feature extraction; Information retrieval; Text processing; Writing; Text Mining; information storage and information retrieval; query languages;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Communication and Embedded Systems (ICICES), 2013 International Conference on
Conference_Location
Chennai
Print_ISBN
978-1-4673-5786-9
Type
conf
DOI
10.1109/ICICES.2013.6508210
Filename
6508210
Link To Document