DocumentCode :
3289686
Title :
Arabic Topic Detection using automatic text summarisation
Author :
Koulali, Rim ; El-Haj, Mahmoud ; Meziane, Abdelkrim
Author_Institution :
LARI Lab., Mohammed I Univ., Oujda, Morocco
fYear :
2013
fDate :
27-30 May 2013
Firstpage :
1
Lastpage :
4
Abstract :
With the exponential growth of the online available Arabic documents, classifying and processing large Arabic corpora has became a challenging task. The presence of noisy information embedded in these documents has made it even more difficult to get accurate results when applying a Topic Detection (TD) process. To address this problem, a proper features selection approach is needed to enhance the topic detection accuracy. In this paper, we explore the impact of using automatic summarisation technique along with a feature-selection process to enhance Arabic Topic Detection. In our work we show that using automatic summarisation reduces noisy information and results in a significant enhancement to the topic detection process and therefore increases the performance of our TD system. This was achieved by the ability of our summariser system in reducing documents size to speed up the detection process.
Keywords :
natural language processing; text analysis; Arabic corpora; Arabic topic detection; TD process; TD system; automatic summarisation technique; automatic text summarisation; feature selection approach; feature-selection process; noisy information; online available Arabic documents; summariser system; Educational institutions; Electronic mail; Feature extraction; Natural language processing; Noise measurement; Training; Vectors; Automatic Summarisation; Cosine Similarity; Natural Language Processing; TF-IDF; Topic Detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Systems and Applications (AICCSA), 2013 ACS International Conference on
Conference_Location :
Ifrane
ISSN :
2161-5322
Type :
conf
DOI :
10.1109/AICCSA.2013.6616460
Filename :
6616460
Link To Document :
بازگشت