DocumentCode :
2833113
Title :
Improvement of Supervised Isomap Algorithm and Its Application to Visualization and Categorization of Web Chinese Text
Author :
Jia, Tu ; Yi, Wu
Author_Institution :
Dept. of Math. & Syst. Sci., Nat. Univ. of Defense Technol., Changsha
fYear :
2008
fDate :
Aug. 29 2008-Sept. 2 2008
Firstpage :
157
Lastpage :
161
Abstract :
It is important to reduce the dimensionality of features in Web Chinese text categorization. Isomap algorithm is an unsupervised manifold learning technique. SIIsomap algorithm, an extension of Isomap to supervised feature extraction, is proposed in this paper. It uses adding constant method and a direct embedding technique of Isomap algorithm for testing data to make the embedding more reasonable and easier. SIIsomap algorithm is applied to visualization and classification experiments of Web Chinese text as a feature extraction method. In contrast with existed methods, it gets better visualization and classification effects and illustrates the effectiveness of our method.
Keywords :
Internet; data mining; data visualisation; feature extraction; pattern classification; text analysis; unsupervised learning; Web Chinese text; data visualization; direct embedding technique; feature extraction method; pattern classification; supervised feature extraction; supervised isomap algorithm; unsupervised manifold learning technique; Application software; Computer science; Data visualization; Feature extraction; Information technology; Kernel; Principal component analysis; Testing; Text categorization; Training data; Feature Extraction; Isomap; Supervised Isomap; Web Chinese Text Categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Technology, 2008. ICCSIT '08. International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-0-7695-3308-7
Type :
conf
DOI :
10.1109/ICCSIT.2008.56
Filename :
4624852
Link To Document :
بازگشت