DocumentCode :
2167217
Title :
Text summarization features selection method using pseudo Genetic-based model
Author :
Abuobieda, Albaraa ; Salim, Naomie ; Albaham, Ameer Tawfik ; Osman, Ahmed Hamza ; Kumar, Yogan Jaya
Author_Institution :
Fac. of Comput. Sci. & Inf. Syst., Univ. Teknol. Malaysia, Skudai, Malaysia
fYear :
2012
fDate :
13-15 March 2012
Firstpage :
193
Lastpage :
197
Abstract :
The features are considered the cornerstone of text summarization. The most important issue is what feature to be considered in a text summarization process. Including all the features in the summarization process may not be considered as an optimal solution. Therefore, other methods need to be deployed. In this paper, random five features used and investigated using a (pseudo) Genetic concept as an optimized trainable features selection mechanism. The Document Understanding Conference (DUC2002) used to train our proposed model; hence the objective of this paper is to learn the weight (importance) of each used feature. For each input document using the genetic concept, the size of the generation is defined and the chromosome dimension (genes) is equal to number of features used. Each gene is represents a feature and in binary format. A chromosome with high fitness value is selected to be enrolled in the final round. The average of each gene is computed for all best chromosomes and considered the weight of that feature. Our experimental result shows that our proposed model is able performing features selection process.
Keywords :
data reduction; information retrieval; optimisation; statistical analysis; text analysis; Document Understanding Conference; binary format; chromosome dimension genes; feature weight; fitness value; optimal solutions; optimized trainable feature selection mechanism; pseudogenetic-based model; text summarization feature selection method; Biological cells; Computer science; Feature extraction; Genetics; Optimization; Probabilistic logic; Training; Features´ Weights; Genetic; Probabilistic; Sentence Scores; Similarity; Summarization; Text Features;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Retrieval & Knowledge Management (CAMP), 2012 International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4673-1091-8
Type :
conf
DOI :
10.1109/InfRKM.2012.6204980
Filename :
6204980
Link To Document :
بازگشت