Title :
Converting unstructured and semi-structured data into knowledge
Author :
Rusu, Octavian ; Halcu, Ionela ; Grigoriu, O. ; Neculoiu, Giorgian ; Sandulescu, V. ; Marinescu, Mariana ; Marinescu, Virgil
Author_Institution :
Agency ARNIEC/RoEduNet, Alexandru Ioan Cuza Univ. Iasi, Iasi, Romania
Abstract :
With the rapid growth in number and dimension of databases and database applications in business, administrative, industrial and other fields, it is necessary to examine the automatic extraction of knowledge from these large databases. Due to knowledge extraction from databases, these have become rich and safe sources for generating and verification of knowledge, and the knowledge discovery can be applied in software management, querying process, making decisions, process control and many other fields of interest. At the same time, there is a challenge in managing unstructured data. Among organizations with large concentration of unstructured information, there is a greater tendency to devote more resources to this kind of data. The acquisition of knowledge from unstructured data is often difficult and expensive. Some possible solutions on extracting useful information (knowledge) from unstructured data are provided. Knowledge extraction is the process of creation of knowledge from structured, unstructured and semi-structured data. The objective of this paper is to present the possibilities of extracting knowledge from unstructured and semi-structured data particularly. The theories and tools for knowledge extraction are the subject of the emerging field of knowledge discovery in databases (KDD). Definitions of KDD are provided and the general multistep KDD process is outlined. A brief summary of recent KDD real-world applications is also provided. Finally, the article enumerates challenges for future research and development in KDD systems.
Keywords :
data handling; data mining; data structures; database management systems; KDD; administrative field; automatic knowledge extraction; business field; database application; decision making; industrial field; knowledge acquisition; knowledge discovery in databases; knowledge generation; knowledge verification; large database; process control; querying process; semistructured data conversion; software management; unstructured data conversion; unstructured data management; useful information extraction; Data mining; Data models; Databases; Knowledge discovery; Object oriented modeling; Organizations; KDD; knowledge; knowledge discovery; semi-structured data; unstructured data;
Conference_Titel :
Roedunet International Conference (RoEduNet), 2013 11th
Conference_Location :
Sinaia
Print_ISBN :
978-1-4673-6114-9
DOI :
10.1109/RoEduNet.2013.6511736