• DocumentCode
    2050776
  • Title

    Automated query generation Of Rdbms for information and knowledge extraction

  • Author

    Arun, Abhishek ; Srinivasan, P.

  • Author_Institution
    Dept. of CSE, Muthayammal Eng. Coll., Rasipuram, India
  • fYear
    2013
  • fDate
    21-22 Feb. 2013
  • Firstpage
    468
  • Lastpage
    473
  • Abstract
    Information extraction systems traditionally implemented as a pipeline of special-purpose processing modules targeting the extraction of a particular kind of information. A major drawback is, whenever a new extraction goal emerges or a module is improved, extraction has to be reapplied from scratch to the entire text corpus even though a small part of corpus might be affected. By using database queries, information extraction enables the generic extraction and minimizes re-processing of data. Furthermore, this provides automated query generation components so that, casual users no need to learn the query language in order to perform extraction. To demonstrate the feasibility of our incremental extraction approach, experiments can be performed to highlight two important aspects of an information extraction system: efficiency and quality of extraction results. The existing extraction frameworks do not provide the capabilities of managing intermediate processed data such as parse trees and information. It is also extended to per-sentence extraction, it is important to notice that the query language itself is capable of defining patterns across multiple sentences. Hence, in order to provide nearest and good result the incremental approach is compared with the existing systems.
  • Keywords
    data mining; database management systems; pipeline processing; query languages; query processing; text analysis; RDBMS; automated query generation components; data reprocessing minimization; database queries; extraction results efficiency; extraction results quality; incremental extraction approach; information extraction system; intermediate processed data management; knowledge extraction; per-sentence extraction; pipeline processing; query language; special-purpose processing modules; text corpus; text mining; Data mining; Database languages; Databases; Feature extraction; Information retrieval; Text processing; Writing; Text Mining; information storage and information retrieval; query languages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Communication and Embedded Systems (ICICES), 2013 International Conference on
  • Conference_Location
    Chennai
  • Print_ISBN
    978-1-4673-5786-9
  • Type

    conf

  • DOI
    10.1109/ICICES.2013.6508210
  • Filename
    6508210