• DocumentCode
    3457489
  • Title

    Applications of multilingual text retrieval

  • Author

    Croft, W. Bruce ; Broglio, John ; Fujii, Hideo

  • Author_Institution
    Dept. of Comput. Sci., Massachusetts Univ., Amherst, MA, USA
  • Volume
    5
  • fYear
    1996
  • fDate
    3-6 Jan 1996
  • Firstpage
    98
  • Abstract
    The recent enormous increase in the use of networked information access and on-line databases has led to more databases being available in languages other than English. The Center for Intelligent Information Retrieval (CIIR) at the University of Massachusetts is involved in a variety of industrial government, and digital library applications which have a need for multilingual text retrieval. Most information retrieval research, however has been evaluated using English databases and queries, and relatively little is and own about how well advanced statistical techniques that incorporate ranking and term weight perform in different languages. We describe our experience with a range of projects involving text retrieval in Spanish, Japanese and Chinese. The issues covered by these projects include document representation techniques such as morphology and segmentation, query formulation and expansion techniques, relevance feedback and comparisons of retrieval effectiveness with English databases. The results indicate that advanced statistical techniques are effective in a wide range of languages, and that new languages can be incorporated with only moderate effort
  • Keywords
    government data processing; information retrieval; libraries; library automation; query formulation; relevance feedback; statistical analysis; Center for Intelligent Information Retrieval; Chinese text; English databases; Japanese text; Spanish text; digital library applications; document representation techniques; expansion techniques; government applications; industrial applications; information retrieval; morphology; multilingual text retrieval; networked information access; on-line databases; query formulation; ranking; relevance feedback; retrieval effectiveness; segmentation; term weight; Application software; Computer science; Databases; Encoding; Feedback; Government; Information retrieval; Natural languages; Software libraries; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System Sciences, 1996., Proceedings of the Twenty-Ninth Hawaii International Conference on ,
  • Conference_Location
    Wailea, HI
  • Print_ISBN
    0-8186-7324-9
  • Type

    conf

  • DOI
    10.1109/HICSS.1996.495303
  • Filename
    495303