Title :
Evaluating the performance of nonnegative matrix factorization for constructing semantic spaces: Comparison to latent semantic analysis
Author_Institution :
Dept. of Inf., Univ. of Electro-Commun., Chofu, Japan
Abstract :
This study examines the ability of nonnegative matrix factorization (NMF) as a method for constructing semantic spaces, in which the meaning of each word is represented by a high-dimensional vector. The performance of two tests (i.e., a multiple-choice synonym test and a word association test) is compared between NMF and latent semantic analysis (LSA), which is the most popular method for constructing semantic spaces. As a result, it was found that NMF did not outperform LSA in either test. This finding indicates that NMF is less effective in acquiring word meanings than expected in the literature; in other words, the finding provides evidence for the ability of LSA to represent semantic meanings. Some properties of NMF were also revealed with reference to its ability to represent word meanings; the random initialization was superior to the SVD-based initialization, and the Euclidean distance is more appropriate for the objective function of NMF than the KL-divergence. In addition, it was shown that the inner product was a more appropriate method for measuring the syntagmatic similarity in a semantic space model, while the cosine was a better method for computing the paradigmatic similarity.
Keywords :
matrix decomposition; natural language processing; string matching; word processing; Euclidean distance; KL divergence; SVD based initialization; high dimensional vector; latent semantic analysis; multiple choice synonym test; nonnegative matrix factorization; objective function; semantic meaning; semantic space construction; syntagmatic similarity; word association test; Accuracy; Matrix decomposition; Semantics; Latent semantic analysis; Nonnegative matrix factorization; Semantic space; Singular value decomposition; Syntagmatic and paradigmatic relations; Word association; Word meaning;
Conference_Titel :
Systems Man and Cybernetics (SMC), 2010 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-6586-6
DOI :
10.1109/ICSMC.2010.5641939