DocumentCode :
3522890
Title :
Managing the knowledge contained in electronic documents: a clustering method for text mining
Author :
Iiritano, S. ; Ruffolo, M.
Author_Institution :
Getrronics SpA, Rende, Italy
fYear :
2001
fDate :
2001
Firstpage :
454
Lastpage :
458
Abstract :
The huge amount of unstructured data available on the Web and the intranets creates an information overloading problem. So, managing the knowledge contained in the textual documents is an important problem of Knowledge Management. Knowledge Extraction from collections of data is possible by Knowledge Discovery in Database (KDD), an interactive and iterative process focused on the exploration of data to discover new and interesting patterns within them. The fundamental phase of KDD process is Data Mining if data are in structured form and Text Mining when they are unstructured. This paper describes a prototype of a vertical corporate portal that implements a KDD process for knowledge extraction from unstructured data contained in textual documents. Text mining is realized through a clustering method that produces a partition of a set of documents on the basis of their contents characterized through the frequency of the words
Keywords :
data mining; information resources; intranets; pattern clustering; KDD; Web; clustering method; data mining; electronic documents; intranets; knowledge discovery in database; knowledge extraction; knowledge management; text mining; textual documents; unstructured data; vertical corporate portal; web portal; Clustering methods; Data mining; Databases; Frequency; Instruments; Knowledge management; Portals; Prototypes; Refining; Text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications, 2001. Proceedings. 12th International Workshop on
Conference_Location :
Munich
Print_ISBN :
0-7695-1230-5
Type :
conf
DOI :
10.1109/DEXA.2001.953103
Filename :
953103
Link To Document :
بازگشت