DocumentCode
2306353
Title
An Approach for Word Categorization Based on Semantic Similarity Measure Obtained from Search Engines
Author
Amasyali, M. Fatih
Author_Institution
Bilgisayar Muhendisligi Bolumu, Yildiz Teknik Univ., Istanbul
fYear
2006
fDate
17-19 April 2006
Firstpage
1
Lastpage
4
Abstract
Word categorization based on semantic similarity is a problem need to be solved for several natural language applications. A similarity measure is need for word categorization. In this study it is proposed that the semantic similarity between two Turkish words is in direct proportion to the number of pages which the words are located next to each other. Google and Yahoo search engines were used to find the number of pages. In the first attempt to verify the proposal, the experiments were done with small datasets. The average success ratio is 87%.
Keywords
classification; natural language processing; search engines; Google search engines; Turkish words; Yahoo search engines; natural language applications; semantic similarity measure; word categorization; Internet; Natural languages; Proposals; Search engines;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Communications Applications, 2006 IEEE 14th
Conference_Location
Antalya
Print_ISBN
1-4244-0238-7
Type
conf
DOI
10.1109/SIU.2006.1659840
Filename
1659840
Link To Document