• DocumentCode
    3705117
  • Title

    An improved approach to English-Hindi based Cross Language Information Retrieval system

  • Author

    Eva Katta;Anuja Arora

  • Author_Institution
    Department of CSE, Jaypee University (JIIT), Noida, India
  • fYear
    2015
  • Firstpage
    354
  • Lastpage
    359
  • Abstract
    Cross Language Information Retrieval (CLIR) is a sub domain of Information Retrieval. It deals with retrieval of information in a specified language that is different from the language of user´s query. In this paper, an improved English-Hindi based CLIR is proposed. There are various un-noticed domains in this broad research area that are required to be worked upon in order to improve the performance of an English-Hindi based CLIR. Not much research effort has been put up to improve the searching and ranking aspects of CLIR systems, especially in case of English-Hindi based CLIR. This paper focuses on applying algorithms like Naïve Bayes and particle swarm optimization in order to improve ranking and searching aspects of a CLIR system. We matched terms contained in documents to the query terms in same sequence as present in the search query to make our system more efficient. Along with this our approach also makes use of bilingual English-Hindi translator for query conversion in Hindi language. Further, we use Hindi query extension and synonym generation which helps in retrieving more relevant results in an English-Hindi based CLIR as compared to existing one. Both of these techniques applied to this improved approach gives user a change to choose more appropriate Hindi query than just by using the single translated query and hence improving overall performance.
  • Keywords
    "Clustering algorithms","Particle swarm optimization","Web sites","Algorithm design and analysis","Indexing","Dictionaries"
  • Publisher
    ieee
  • Conference_Titel
    Contemporary Computing (IC3), 2015 Eighth International Conference on
  • Print_ISBN
    978-1-4673-7947-2
  • Type

    conf

  • DOI
    10.1109/IC3.2015.7346706
  • Filename
    7346706