Title of article :
A Corpus based Approach to Find Similar Keywords for Search Engine Marketing
Author/Authors :
Siddiqui, Muazzam King Abdulaziz University - Faculty of Computing and Information Technology, Saudi Arabia , Fayoumi, Mohammad Umm Al-Qura University - Faculty of Computers and Information Systems, Saudi Arabia , Yusuf, Nidal Al-Isra University - Faculty of Information Technology, Jordan
From page :
460
To page :
466
Abstract :
Automatic thesaurus generation is used by search engines for query expansion. The same concept is used by search engine marketing companies to suggest keyword terms to their clients to improve the client’s ratings for different search engines. This paper presents and evaluates a corpus based method to find similar terms. The corpus is generated by scraping websites in different categories. A feature selection method is developed that rewards category specific terms and penalizes terms shared by two or more categories. The similarity measure is decomposed into three distinct components, namely contextual, functional and lexical similarities. The contextual similarity measure finds terms that are found in the same context. Functional similarity finds terms on co-occurrence basis while the lexically similar terms share one or more words. An overall similarity measure combines the evidence from these three measures
Keywords :
Information retrieval , text mining , term similarity , search engine marketing
Journal title :
The International Arab Journal of Information Technology (IAJIT)
Journal title :
The International Arab Journal of Information Technology (IAJIT)
Record number :
2544002
Link To Document :
بازگشت