• DocumentCode
    3176353
  • Title

    Design and Implementation-Algorithms of Amharic Search Engine System for Amharic Web Contents

  • Author

    Redwan, Hassen ; Atnafu, Solomon

  • Author_Institution
    Dept. of Comput. Sci., Addis Ababa Univ., Addis Ababa, Ethiopia
  • fYear
    2009
  • fDate
    20-23 Dec. 2009
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    On the Web, the use of languages other than English (e.g., Amharic language) has been growing exponentially. The number of Web documents in Amharic language as well as Internet users in Ethiopia is growing dramatically. However, the major search engines have been lagging behind in providing indexes, stemming and search features to handle this language. Therefore, the design and implementation of Web search engine that considers the typical characteristics of the Amharic language is needed. In this paper, we design Amharic Search Engine system for Amharic language Web documents and briefly discuss the algorithms for implementing the engine. The crawler, indexer and query engine are the basic components of this search engine. Typical characteristics of the Amharic language were considered by testing the engine for morphological variants as well as Amharic aliases support. For experimentation, two runs of the crawler were conducted by using 10 threads that crawl in parallel.
  • Keywords
    Internet; indexing; natural language processing; query processing; search engines; Amharic Web contents; Amharic alias support; Amharic language; Amharic search engine system; Ethiopia; Internet users; Web documents; crawler; indexer; query engine; Computer science; Crawlers; Electronic mail; Information resources; Information retrieval; Internet; Natural languages; Search engines; Web pages; Web search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    New Technologies, Mobility and Security (NTMS), 2009 3rd International Conference on
  • Conference_Location
    Cairo
  • Print_ISBN
    978-1-4244-4765-7
  • Type

    conf

  • DOI
    10.1109/NTMS.2009.5384814
  • Filename
    5384814