Title :
Applications of multilingual text retrieval
Author :
Croft, W. Bruce ; Broglio, John ; Fujii, Hideo
Author_Institution :
Dept. of Comput. Sci., Massachusetts Univ., Amherst, MA, USA
Abstract :
The recent enormous increase in the use of networked information access and on-line databases has led to more databases being available in languages other than English. The Center for Intelligent Information Retrieval (CIIR) at the University of Massachusetts is involved in a variety of industrial government, and digital library applications which have a need for multilingual text retrieval. Most information retrieval research, however has been evaluated using English databases and queries, and relatively little is and own about how well advanced statistical techniques that incorporate ranking and term weight perform in different languages. We describe our experience with a range of projects involving text retrieval in Spanish, Japanese and Chinese. The issues covered by these projects include document representation techniques such as morphology and segmentation, query formulation and expansion techniques, relevance feedback and comparisons of retrieval effectiveness with English databases. The results indicate that advanced statistical techniques are effective in a wide range of languages, and that new languages can be incorporated with only moderate effort
Keywords :
government data processing; information retrieval; libraries; library automation; query formulation; relevance feedback; statistical analysis; Center for Intelligent Information Retrieval; Chinese text; English databases; Japanese text; Spanish text; digital library applications; document representation techniques; expansion techniques; government applications; industrial applications; information retrieval; morphology; multilingual text retrieval; networked information access; on-line databases; query formulation; ranking; relevance feedback; retrieval effectiveness; segmentation; term weight; Application software; Computer science; Databases; Encoding; Feedback; Government; Information retrieval; Natural languages; Software libraries; Telephony;
Conference_Titel :
System Sciences, 1996., Proceedings of the Twenty-Ninth Hawaii International Conference on ,
Conference_Location :
Wailea, HI
Print_ISBN :
0-8186-7324-9
DOI :
10.1109/HICSS.1996.495303