Title :
An Approach for Word Categorization Based on Semantic Similarity Measure Obtained from Search Engines
Author :
Amasyali, M. Fatih
Author_Institution :
Bilgisayar Muhendisligi Bolumu, Yildiz Teknik Univ., Istanbul
Abstract :
Word categorization based on semantic similarity is a problem need to be solved for several natural language applications. A similarity measure is need for word categorization. In this study it is proposed that the semantic similarity between two Turkish words is in direct proportion to the number of pages which the words are located next to each other. Google and Yahoo search engines were used to find the number of pages. In the first attempt to verify the proposal, the experiments were done with small datasets. The average success ratio is 87%.
Keywords :
classification; natural language processing; search engines; Google search engines; Turkish words; Yahoo search engines; natural language applications; semantic similarity measure; word categorization; Internet; Natural languages; Proposals; Search engines;
Conference_Titel :
Signal Processing and Communications Applications, 2006 IEEE 14th
Conference_Location :
Antalya
Print_ISBN :
1-4244-0238-7
DOI :
10.1109/SIU.2006.1659840