Title :
Fuzzy feature analysis for unsupervised knowledge discovery in narrative texts
Author :
Perrin, Patrick ; Petry, Frederick
Author_Institution :
Dept. of Comput. Sci., Tulane Univ., New Orleans, LA, USA
Abstract :
This work demonstrates an application of fuzzy sets to the feature analysis problem to enhance the ability to find interesting knowledge in textual databases. We believe that the extraction of a satisfiable set of features yields to represent texts with a sufficiently structured and interesting system in a finite space leading to the discovery of interesting knowledge. We discuss an approach to extract a set of good features to effectively represent each text and best ensure that interesting knowledge can be found by applying any data mining technique. Feature analysis consists of automatically extracting context-dependent measurements to sense events described in narrative texts and transform them into a finite set of features maximizing the cohesion of the system (minimizing the entropy) with an acceptable level of interestingness. Preliminary experiments on real databases of dictated psychiatric evaluation notes were promising. Further experiments are in progress to further demonstrate the relationships between a situation´s structure and the level of guarantee to find interesting information
Keywords :
character recognition; fuzzy set theory; information retrieval; knowledge acquisition; minimum entropy methods; natural languages; unsupervised learning; visual databases; cohesion; context-dependent measurements; data mining technique; dictated psychiatric evaluation notes; fuzzy feature analysis; fuzzy sets; interestingness level; narrative texts; textual databases; unsupervised knowledge discovery; Computer science; Data mining; Deductive databases; Entropy; Extraterrestrial measurements; Extraterrestrial phenomena; Fuzzy sets; Intelligent systems; Knowledge based systems; Spatial databases;
Conference_Titel :
Fuzzy Systems, 1997., Proceedings of the Sixth IEEE International Conference on
Conference_Location :
Barcelona
Print_ISBN :
0-7803-3796-4
DOI :
10.1109/FUZZY.1997.619772