DocumentCode
2744915
Title
A New Hybrid Farsi Text Summarization Technique Based on Term Co-Occurrence and Conceptual Property of the Text
Author
Zamanifar, Azadeh ; Minaei-Bidgoli, Behrouz ; Sharifi, Mohsen
Author_Institution
Fac. of Comput. Eng.,, Iran Univ. of Sci. & Technol., Tehran
fYear
2008
fDate
6-8 Aug. 2008
Firstpage
635
Lastpage
639
Abstract
The importance of text summarization grows rapidly as the amount of information increases exponentially. This paper presents a new hybrid summarization technique that combines statistical properties of documents with Farsi linguistic features. The originality of the technique lies on the use of term co-occurrence property of the text. It could detect the number of subjects. The proposed technique summarizes the document in proportion to the subject treated in a document. It considers the conceptual property of the text algorithm and based on word synonymy prevents similar sentences to be included in the summary. It also preserves the cohesion of the summarized text. Our results show better performance in comparison with FarsiSum, well known Farsi Summarizer, which is based only on the heuristic property of the text and do not consider the Farsi challenges.
Keywords
natural language processing; statistical analysis; text analysis; Farsi linguistic features; Farsi summarizer; conceptual property; documents; hybrid Farsi text summarization; statistical properties; summarized text; term cooccurrence property; Artificial intelligence; Computer networks; Concurrent computing; Data mining; Distributed computing; Humans; Natural languages; Software engineering; Spatial databases; Writing;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2008. SNPD '08. Ninth ACIS International Conference on
Conference_Location
Phuket
Print_ISBN
978-0-7695-3263-9
Type
conf
DOI
10.1109/SNPD.2008.57
Filename
4617444
Link To Document