• DocumentCode
    3529761
  • Title

    Application of Domain-Specific Search Method in Meta-Search Engine on Internet

  • Author

    Zheng Wang ; Qing Wang ; Dingwei Wang

  • Author_Institution
    Institute of System Engineering, Northeastern University, Shen Yang, 110004 China. Phone: +86-24-83677667, Fax: +86-24-83687760, E-mail: wangzhengdr@gmail.com
  • fYear
    2006
  • fDate
    Oct. 2006
  • Firstpage
    2078
  • Lastpage
    2085
  • Abstract
    With the Internet´s prosperity, more surfers find it is difficult to retrieve information efficiently. Domain-Specific Search Engine (DSSE) is one of solutions to alleviate such inefficient situation. This paper proposes a new method for building DSSE based on Meta-Search Engine (MSE) on Internet. First, Odds Ratio method is used to refine the domain-specific keywords, which are weighted by means of TF-IDF method. Second, domain query expression is constructed with the keywords by Decision Tree method to reflect the features of domain documents. Combining user´s plain query with the domain query expression, the MSE constructs a domain-specific query expression and submits it to General-purpose Search Engines (GSE). The Extended Boolean Model is employed to rank the search results back from the GSE. Finally, the ranked results are returned to the query user. The experiment results show that the proposed domain-specific search method is effective in searching domain documents. The designed DSSE is more simple and efficient than the dedicated domain-specific search engines.
  • Keywords
    Decision support systems; Decision trees; Electronic mail; Information retrieval; Internet; Metasearch; Search engines; Search methods; Systems engineering and theory; Web pages; decision tree; domain-specific search engine; extended boolean model; meta-search engine; odds ratio;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Engineering in Systems Applications, IMACS Multiconference on
  • Conference_Location
    Beijing, China
  • Print_ISBN
    7-302-13922-9
  • Electronic_ISBN
    7-900718-14-1
  • Type

    conf

  • DOI
    10.1109/CESA.2006.313656
  • Filename
    4105722