• DocumentCode
    2114445
  • Title

    Converting unstructured and semi-structured data into knowledge

  • Author

    Rusu, Octavian ; Halcu, Ionela ; Grigoriu, O. ; Neculoiu, Giorgian ; Sandulescu, V. ; Marinescu, Mariana ; Marinescu, Virgil

  • Author_Institution
    Agency ARNIEC/RoEduNet, Alexandru Ioan Cuza Univ. Iasi, Iasi, Romania
  • fYear
    2013
  • fDate
    17-19 Jan. 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    With the rapid growth in number and dimension of databases and database applications in business, administrative, industrial and other fields, it is necessary to examine the automatic extraction of knowledge from these large databases. Due to knowledge extraction from databases, these have become rich and safe sources for generating and verification of knowledge, and the knowledge discovery can be applied in software management, querying process, making decisions, process control and many other fields of interest. At the same time, there is a challenge in managing unstructured data. Among organizations with large concentration of unstructured information, there is a greater tendency to devote more resources to this kind of data. The acquisition of knowledge from unstructured data is often difficult and expensive. Some possible solutions on extracting useful information (knowledge) from unstructured data are provided. Knowledge extraction is the process of creation of knowledge from structured, unstructured and semi-structured data. The objective of this paper is to present the possibilities of extracting knowledge from unstructured and semi-structured data particularly. The theories and tools for knowledge extraction are the subject of the emerging field of knowledge discovery in databases (KDD). Definitions of KDD are provided and the general multistep KDD process is outlined. A brief summary of recent KDD real-world applications is also provided. Finally, the article enumerates challenges for future research and development in KDD systems.
  • Keywords
    data handling; data mining; data structures; database management systems; KDD; administrative field; automatic knowledge extraction; business field; database application; decision making; industrial field; knowledge acquisition; knowledge discovery in databases; knowledge generation; knowledge verification; large database; process control; querying process; semistructured data conversion; software management; unstructured data conversion; unstructured data management; useful information extraction; Data mining; Data models; Databases; Knowledge discovery; Object oriented modeling; Organizations; KDD; knowledge; knowledge discovery; semi-structured data; unstructured data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Roedunet International Conference (RoEduNet), 2013 11th
  • Conference_Location
    Sinaia
  • ISSN
    2068-1038
  • Print_ISBN
    978-1-4673-6114-9
  • Type

    conf

  • DOI
    10.1109/RoEduNet.2013.6511736
  • Filename
    6511736