DocumentCode :
1565569
Title :
Categorizing Fanatic Texts by Integrating Explanation Patterns with Na ï ve Bayes Classifier
Author :
Almonayyes, A.
Author_Institution :
Dept. of Math. & Comput. Sci., Kuwait Univ., Safat
Volume :
2
fYear :
2005
Firstpage :
1279
Lastpage :
1283
Abstract :
Exploratory data analysis over foreign language text presents virtually untapped opportunity. This work incorporates naive Bayes classifier with case-based reasoning in order to classify and analyze Arabic texts related to fanaticism. The Arabic vocabularies are converted to equivalent English words using conceptual hierarchy structure. The understanding process operates at two phases. At the first phase, a discrimination network of multiple questions is used to retrieve explanatory knowledge structures each of which gives an interpretation of a text according to a particular aspect of fanaticism. Explanation structures organize past documents of fanatic content. Similar documents are retrieved to generate additional valuable information about the new document. In the second phase, the document classification process based on naive Bayes is used to classify documents into their fanatic class. The results show that the classification accuracy is improved by incorporating the explanation patterns with the naive Bayes classifier
Keywords :
Bayes methods; case-based reasoning; data analysis; natural languages; pattern classification; text analysis; vocabulary; Arabic texts; case-based reasoning; conceptual hierarchy structure; document classification; explanation patterns; exploratory data analysis; fanatic texts categorization; foreign language text; naive Bayes classifier; Computer science; Data analysis; Data mining; Information analysis; Information retrieval; Machine learning; Mathematics; Natural languages; Testing; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks and Brain, 2005. ICNN&B '05. International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7803-9422-4
Type :
conf
DOI :
10.1109/ICNNB.2005.1614844
Filename :
1614844
Link To Document :
بازگشت