DocumentCode
677220
Title
Future trends in managing extracted information
Author
Yafooz, Wael M. S. ; Abidin, Siti Z. Z. ; Omar, Normaliza ; Idrus, Zainura
Author_Institution
Fac. of Comput. & Math. Sci., Univ. Teknol. MARA, Shah Alam, Malaysia
fYear
2013
fDate
Nov. 29 2013-Dec. 1 2013
Firstpage
279
Lastpage
283
Abstract
Web technology is currently used in all daily activities and is considered a backbone of life. The amount of information continuously increases and grows, specifically that of unstructured information that has no rules or constraints. Such information is difficult to handle and thus requires organization and management before it can be useful. Information extraction techniques are efficient methods of converting unstructured documents into structured data. Attempts have been made to extract structured information that can be used with small amounts of textual data. However, for large amounts of data such as those found in the World Wide Web, the amount of extracted information is huge, and the relationships between extracted information are difficult to determine. Studies that focus on managing extracted information are few. In this paper, we present an overview of the recent studies on managing unstructured information, information extraction and managing extracted information. Managing extracted data using our proposed model for the rapid extraction and clustering of unstructured data for back-end applications in low-level of relational database systems is highlighted. This paper is intended for researchers interested in information extraction management and its applications.
Keywords
Web sites; information retrieval; pattern clustering; relational databases; text analysis; Web technology; World Wide Web; backend applications; daily activity; information extraction management; relational database systems; structured information extraction; textual data; unstructured data clustering; unstructured documents; unstructured information management; Communities; Computers; Conferences; Data mining; Databases; Information retrieval; Semantics; extracted information management; information extraction; managing unstructured information; relational databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Control System, Computing and Engineering (ICCSCE), 2013 IEEE International Conference on
Conference_Location
Mindeb
Print_ISBN
978-1-4799-1506-4
Type
conf
DOI
10.1109/ICCSCE.2013.6719974
Filename
6719974
Link To Document