Title :
Web-based information access: multilingual automatic authoring
Author :
Basili, Roberto ; Pazienza, Maria Teresa ; Zanzotto, Fabio Massimo
Author_Institution :
Dept. of Comput. Sci., Syst. & Production, Rome Univ., Italy
Abstract :
The need for managing similar documents in different languages is increasing with the growing amount of electronic information that is available in documents of the same type (e.g. news streams). This paper proposes a viable approach to information access, emphasising the hypertextual paradigm in a multilingual framework. This task of processing/structuring text so that cross-lingual hypertext links are generated is called multilingual authoring (MA). Methods from natural language processing, especially information extraction, to both monolingual authoring and MA are described, and a general architecture for MA is defined. The effectiveness of the proposed approach is discussed using the description of the NAMIC (News Agencies Multilingual Information Categorisation) prototype system.
Keywords :
authoring systems; hypermedia; information resources; information retrieval; natural languages; text analysis; NAMIC system; News Agencies Multilingual Information Categorisation; World Wide Web-based information access; cross-lingual hypertext links; document languages; electronic information; information extraction; monolingual authoring; multilingual automatic authoring; natural language processing; news streams; similar documents; text processing; text structuring; Computer science; Data mining; Information retrieval; Large-scale systems; Multimedia systems; Natural language processing; Production systems; Prototypes; Space technology; Streaming media;
Conference_Titel :
Information Technology: Coding and Computing, 2002. Proceedings. International Conference on
Print_ISBN :
0-7695-1506-1
DOI :
10.1109/ITCC.2002.1000446