DocumentCode :
2050776
Title :
Automated query generation Of Rdbms for information and knowledge extraction
Author :
Arun, Abhishek ; Srinivasan, P.
Author_Institution :
Dept. of CSE, Muthayammal Eng. Coll., Rasipuram, India
fYear :
2013
fDate :
21-22 Feb. 2013
Firstpage :
468
Lastpage :
473
Abstract :
Information extraction systems traditionally implemented as a pipeline of special-purpose processing modules targeting the extraction of a particular kind of information. A major drawback is, whenever a new extraction goal emerges or a module is improved, extraction has to be reapplied from scratch to the entire text corpus even though a small part of corpus might be affected. By using database queries, information extraction enables the generic extraction and minimizes re-processing of data. Furthermore, this provides automated query generation components so that, casual users no need to learn the query language in order to perform extraction. To demonstrate the feasibility of our incremental extraction approach, experiments can be performed to highlight two important aspects of an information extraction system: efficiency and quality of extraction results. The existing extraction frameworks do not provide the capabilities of managing intermediate processed data such as parse trees and information. It is also extended to per-sentence extraction, it is important to notice that the query language itself is capable of defining patterns across multiple sentences. Hence, in order to provide nearest and good result the incremental approach is compared with the existing systems.
Keywords :
data mining; database management systems; pipeline processing; query languages; query processing; text analysis; RDBMS; automated query generation components; data reprocessing minimization; database queries; extraction results efficiency; extraction results quality; incremental extraction approach; information extraction system; intermediate processed data management; knowledge extraction; per-sentence extraction; pipeline processing; query language; special-purpose processing modules; text corpus; text mining; Data mining; Database languages; Databases; Feature extraction; Information retrieval; Text processing; Writing; Text Mining; information storage and information retrieval; query languages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Communication and Embedded Systems (ICICES), 2013 International Conference on
Conference_Location :
Chennai
Print_ISBN :
978-1-4673-5786-9
Type :
conf
DOI :
10.1109/ICICES.2013.6508210
Filename :
6508210
Link To Document :
بازگشت