• DocumentCode
    2306353
  • Title

    An Approach for Word Categorization Based on Semantic Similarity Measure Obtained from Search Engines

  • Author

    Amasyali, M. Fatih

  • Author_Institution
    Bilgisayar Muhendisligi Bolumu, Yildiz Teknik Univ., Istanbul
  • fYear
    2006
  • fDate
    17-19 April 2006
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Word categorization based on semantic similarity is a problem need to be solved for several natural language applications. A similarity measure is need for word categorization. In this study it is proposed that the semantic similarity between two Turkish words is in direct proportion to the number of pages which the words are located next to each other. Google and Yahoo search engines were used to find the number of pages. In the first attempt to verify the proposal, the experiments were done with small datasets. The average success ratio is 87%.
  • Keywords
    classification; natural language processing; search engines; Google search engines; Turkish words; Yahoo search engines; natural language applications; semantic similarity measure; word categorization; Internet; Natural languages; Proposals; Search engines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications Applications, 2006 IEEE 14th
  • Conference_Location
    Antalya
  • Print_ISBN
    1-4244-0238-7
  • Type

    conf

  • DOI
    10.1109/SIU.2006.1659840
  • Filename
    1659840