DocumentCode :
2251083
Title :
Mining Patterns with Domain Knowledge: A Case Study on Multi-language Data
Author :
Antunes, Cláudia ; Bebiano, Tiago
Author_Institution :
Dept. of Comput. Sci. & Eng., Inst. Super. Tecnico, Lisbon, Portugal
fYear :
2012
fDate :
May 30 2012-June 1 2012
Firstpage :
167
Lastpage :
172
Abstract :
Multi-language data impairs the application of mining techniques in a generalized form, since language remains an impenetrable barrier. The advances on domain driven data mining and the study of its semantic aspects open a first window over it, in particular the D2PM framework [1]. This paper proposes a new method for mining patterns over multi-language data, through the use of the D2FP-Growth algorithm and a language constraint, both defined in the context of the referred framework. The new constraint allows for interpreting a word by its meaning and consequently to overcome language differences.
Keywords :
data mining; ontologies (artificial intelligence); D2FP-growth algorithm; D2PM framework; domain driven data mining; domain knowledge; language constraint; language differences; multilanguage data; ontology; pattern mining; semantic aspects; Abstracts; Context; Data mining; Knowledge based systems; Lungs; Ontologies; Positron emission tomography; Domain Driven Pattern Mining; Multi-language; Ontology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Science (ICIS), 2012 IEEE/ACIS 11th International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-1536-4
Type :
conf
DOI :
10.1109/ICIS.2012.70
Filename :
6211092
Link To Document :
بازگشت