Title :
Efficiency of TR-Classifier versus TFIDF
Author :
Mourad Abbas;Kamel Smaïli;Daoud Berkani
Author_Institution :
Speech Process. Lab., crstdla, Algiers, Algeria
Abstract :
In this paper, we present a method of topic identification based on computing triggers pairs: TR-classifier (Triggers-based classifier). Indeed, it is used for the purpose to identify topics of texts. Hence, the first step to be realized is the construction of a vocabulary for each topic. Topic vocabularies are composed of words ranked according to their frequencies from the maximum to the minimum. We note that the size of each topic vocabulary is 400. For each word of the vocabulary, average mutual information (AMI) is calculated. The used triggers are selected according to the highest AMI values. In order to evaluate the TR Classifier, we compared it to the well-known TFIDF.
Keywords :
"Vocabulary","History","Mutual information","Adaptation model","Educational institutions","Speech","Computers"
Conference_Titel :
Integrated Intelligent Computing (ICIIC), 2010 First International Conference on
Print_ISBN :
978-1-4244-7963-4
DOI :
10.1109/ICIIC.2010.60