Title of article :
Building Semantic Kernel for Persian Text Classification with a Small Amount of Training Data
Author/Authors :
Jadidinejad، Amir H نويسنده Faculty of Computer and Information Technology Engineering, Qazvin Branch, Islamic Azad University, Qazvin, Iran Jadidinejad, Amir H , Marza، Venus نويسنده Department of Computer Engineering, West Tehran Branch, Islamic Azad University, Tehran, Iran Marza, Venus
Issue Information :
فصلنامه با شماره پیاپی سال 2015
Pages :
12
From page :
125
To page :
136
Abstract :
The original idea of semantic kernels is to use semantic features instead of terms appeared in the text document. In this article, the documents are transformed into a new k-dimensional feature space by applying Singular Value Decomposition on the Term-Document matrix and extracting k eigenvectors with higher energy. The suggested semantic kernel causes severe reduction of dimensions which leads to two main conclusions. First, the computational complexity of the classifier is severely reduced. Second, the trained classifier has less sensitivity on the input terms; therefore, it can classify documents effectively. Experiments on Persian documents indicate the absolute superiority of the suggested semantic kernel in comparison to well-known vector space (Bag-of-Words) kernel, especially under the circumstances in which external semantic resources are not available and the amount of available training data is not sufficient
Journal title :
Journal of Advances in Computer Research
Serial Year :
2015
Journal title :
Journal of Advances in Computer Research
Record number :
1985206
Link To Document :
بازگشت