Title :
Future trends in managing extracted information
Author :
Yafooz, Wael M. S. ; Abidin, Siti Z. Z. ; Omar, Normaliza ; Idrus, Zainura
Author_Institution :
Fac. of Comput. & Math. Sci., Univ. Teknol. MARA, Shah Alam, Malaysia
fDate :
Nov. 29 2013-Dec. 1 2013
Abstract :
Web technology is currently used in all daily activities and is considered a backbone of life. The amount of information continuously increases and grows, specifically that of unstructured information that has no rules or constraints. Such information is difficult to handle and thus requires organization and management before it can be useful. Information extraction techniques are efficient methods of converting unstructured documents into structured data. Attempts have been made to extract structured information that can be used with small amounts of textual data. However, for large amounts of data such as those found in the World Wide Web, the amount of extracted information is huge, and the relationships between extracted information are difficult to determine. Studies that focus on managing extracted information are few. In this paper, we present an overview of the recent studies on managing unstructured information, information extraction and managing extracted information. Managing extracted data using our proposed model for the rapid extraction and clustering of unstructured data for back-end applications in low-level of relational database systems is highlighted. This paper is intended for researchers interested in information extraction management and its applications.
Keywords :
Web sites; information retrieval; pattern clustering; relational databases; text analysis; Web technology; World Wide Web; backend applications; daily activity; information extraction management; relational database systems; structured information extraction; textual data; unstructured data clustering; unstructured documents; unstructured information management; Communities; Computers; Conferences; Data mining; Databases; Information retrieval; Semantics; extracted information management; information extraction; managing unstructured information; relational databases;
Conference_Titel :
Control System, Computing and Engineering (ICCSCE), 2013 IEEE International Conference on
Conference_Location :
Mindeb
Print_ISBN :
978-1-4799-1506-4
DOI :
10.1109/ICCSCE.2013.6719974