Title :
Text summarization features selection method using pseudo Genetic-based model
Author :
Abuobieda, Albaraa ; Salim, Naomie ; Albaham, Ameer Tawfik ; Osman, Ahmed Hamza ; Kumar, Yogan Jaya
Author_Institution :
Fac. of Comput. Sci. & Inf. Syst., Univ. Teknol. Malaysia, Skudai, Malaysia
Abstract :
The features are considered the cornerstone of text summarization. The most important issue is what feature to be considered in a text summarization process. Including all the features in the summarization process may not be considered as an optimal solution. Therefore, other methods need to be deployed. In this paper, random five features used and investigated using a (pseudo) Genetic concept as an optimized trainable features selection mechanism. The Document Understanding Conference (DUC2002) used to train our proposed model; hence the objective of this paper is to learn the weight (importance) of each used feature. For each input document using the genetic concept, the size of the generation is defined and the chromosome dimension (genes) is equal to number of features used. Each gene is represents a feature and in binary format. A chromosome with high fitness value is selected to be enrolled in the final round. The average of each gene is computed for all best chromosomes and considered the weight of that feature. Our experimental result shows that our proposed model is able performing features selection process.
Keywords :
data reduction; information retrieval; optimisation; statistical analysis; text analysis; Document Understanding Conference; binary format; chromosome dimension genes; feature weight; fitness value; optimal solutions; optimized trainable feature selection mechanism; pseudogenetic-based model; text summarization feature selection method; Biological cells; Computer science; Feature extraction; Genetics; Optimization; Probabilistic logic; Training; Features´ Weights; Genetic; Probabilistic; Sentence Scores; Similarity; Summarization; Text Features;
Conference_Titel :
Information Retrieval & Knowledge Management (CAMP), 2012 International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4673-1091-8
DOI :
10.1109/InfRKM.2012.6204980