• DocumentCode
    677220
  • Title

    Future trends in managing extracted information

  • Author

    Yafooz, Wael M. S. ; Abidin, Siti Z. Z. ; Omar, Normaliza ; Idrus, Zainura

  • Author_Institution
    Fac. of Comput. & Math. Sci., Univ. Teknol. MARA, Shah Alam, Malaysia
  • fYear
    2013
  • fDate
    Nov. 29 2013-Dec. 1 2013
  • Firstpage
    279
  • Lastpage
    283
  • Abstract
    Web technology is currently used in all daily activities and is considered a backbone of life. The amount of information continuously increases and grows, specifically that of unstructured information that has no rules or constraints. Such information is difficult to handle and thus requires organization and management before it can be useful. Information extraction techniques are efficient methods of converting unstructured documents into structured data. Attempts have been made to extract structured information that can be used with small amounts of textual data. However, for large amounts of data such as those found in the World Wide Web, the amount of extracted information is huge, and the relationships between extracted information are difficult to determine. Studies that focus on managing extracted information are few. In this paper, we present an overview of the recent studies on managing unstructured information, information extraction and managing extracted information. Managing extracted data using our proposed model for the rapid extraction and clustering of unstructured data for back-end applications in low-level of relational database systems is highlighted. This paper is intended for researchers interested in information extraction management and its applications.
  • Keywords
    Web sites; information retrieval; pattern clustering; relational databases; text analysis; Web technology; World Wide Web; backend applications; daily activity; information extraction management; relational database systems; structured information extraction; textual data; unstructured data clustering; unstructured documents; unstructured information management; Communities; Computers; Conferences; Data mining; Databases; Information retrieval; Semantics; extracted information management; information extraction; managing unstructured information; relational databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control System, Computing and Engineering (ICCSCE), 2013 IEEE International Conference on
  • Conference_Location
    Mindeb
  • Print_ISBN
    978-1-4799-1506-4
  • Type

    conf

  • DOI
    10.1109/ICCSCE.2013.6719974
  • Filename
    6719974