DocumentCode
2889656
Title
Document Classification Via TextCC Based on Stereographic Projection
Author
Zhang, Zhen-ya ; Zhang, Shu-guang ; Wang, Xu-fa
Author_Institution
Microsoft Key Lab. of Multimedia Comput. & Commun., Univ. of Sci. & Technol. of China, Hefei
fYear
2006
fDate
13-16 Aug. 2006
Firstpage
1368
Lastpage
1372
Abstract
TextCC can classify real documents instantly by cosine similarity. In this paper, stereographic projection is defined from n dimensional real space to the surface of the unit sphere in (n+1) dimensional space. This paper also proposes the relation between the Euclidean distance in n dimensional space and the cosine similarity in (n+1) dimensional real space. To classify documents with represented vectors normalized by stereographic projection, modification on the construction of the weight matrix of hidden layer of TextCC and the fundamental for those modifications are presented. With those modifications, TextCC can classify real documents instantly by Euclidean distance. Experimental results show that TextCC can classify real documents well by Euclidean distance based on stereographic projection
Keywords
classification; learning (artificial intelligence); matrix algebra; text analysis; vectors; Euclidean distance; TextCC training; automatic text classification; cosine similarity; document classification; stereographic projection; vectors; weight matrix; Computer science; Cybernetics; Electronic mail; Euclidean distance; Feeds; Frequency; Laboratories; Machine learning; Multimedia computing; Neural networks; Text categorization; Vocabulary; Stereographic projection; TextCC; cosine similarity;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2006 International Conference on
Conference_Location
Dalian, China
Print_ISBN
1-4244-0061-9
Type
conf
DOI
10.1109/ICMLC.2006.258706
Filename
4028277
Link To Document