DocumentCode
2448264
Title
Specialization of keyword extraction approach to Persian texts
Author
Khozani, Sayyid Mohammad Hoseini ; Bayat, Hosein
Author_Institution
Dept. of Comput. Sci., Islamic Azad Univ., Tafresh, Iran
fYear
2011
fDate
14-16 Oct. 2011
Firstpage
112
Lastpage
116
Abstract
As the amount of data increases and the relations among them get more complex, access to information implicit in data appears more difficult, and the role of methods of getting data from diverse texts, and analyzing them becomes more significant. Of such methods is the highly effective technique of keyword extraction which shows the concept and content of the original text. In this article, a new approach is presented with the aim of extracting keywords with respect to combined words, and extracting key sentences in Persian documents so as to classify them efficiently. Studies performed on several Persian documents, and comparisons done between the findings of these and other methods have proven that this method extracts keywords of texts with much more accuracy and speed to represent the original concepts.
Keywords
document handling; pattern classification; text analysis; word processing; Persian document classification; Persian text; keyword extraction; word extraction; Classification algorithms; Computers; Data mining; Educational institutions; Feature extraction; Vectors; Persian documents; classification; content; extraction; keywords;
fLanguage
English
Publisher
ieee
Conference_Titel
Soft Computing and Pattern Recognition (SoCPaR), 2011 International Conference of
Conference_Location
Dalian
Print_ISBN
978-1-4577-1195-4
Type
conf
DOI
10.1109/SoCPaR.2011.6089124
Filename
6089124
Link To Document