DocumentCode :
3762270
Title :
A Novel Data Mining Approach for Multi Variant Text Classification
Author :
Kevin Joy Dsouza;Zaheed Ahmed Ansari
Author_Institution :
Dept. of CSE, Srinivas Inst. of Technol., Mangalore, India
fYear :
2015
Firstpage :
68
Lastpage :
73
Abstract :
Text classification, which aims to assign a document to one or more categories based on its content, is a fundamental task for Web and/or document data mining applications. In natural language processing and information extraction fields Text classification is emerging as an important part, were we can use this approach to discover useful information from large database. These approaches allow individuals to construct classifiers that have relevance for a variety of domains. Existing algorithms such as Svm Light have less GUI support and take more time to perform classification task. In this presented work classification of multi-domain documents is performed by using weka-LibSVM classifier. Here to transform collected training set and test set documents into term-document matrix (TDM), the vector space model is used. In classifier TDM is used to generate predicted results. The results emerged from weka with its GUI support using TDM have quick response time in classifying the documents.
Keywords :
"Text categorization","Semantics","Training","Time division multiplexing","Feature extraction","Filtering"
Publisher :
ieee
Conference_Titel :
Cloud Computing in Emerging Markets (CCEM), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/CCEM.2015.11
Filename :
7436933
Link To Document :
بازگشت