Title :
Knowledge management and Cultural Heritage repositories: Cross-Lingual Information Retrieval strategies
Author :
di Buono, M.P. ; Monteleone, M. ; Marano, Francesco ; Monti, Johanna
Author_Institution :
Univ. of Salerno, Fisciano, Italy
fDate :
Oct. 28 2013-Nov. 1 2013
Abstract :
In the last years important initiatives, like the development of the European Library and Europeana, aim to increase the availability of cultural content from various types of providers and institutions. The accessibility to these resources requires the development of environments which allow both to manage multilingual complexity and to preserve the semantic interoperability. The creation of Natural Language Processing (NLP) applications is finalized to the achievement of CrossLingual Information Retrieval (CLIR). This paper presents an ongoing research on language processing based on the LexiconGrammar (LG) approach with the goal of improving knowledge management in the Cultural Heritage repositories. The proposed framework aims to guarantee interoperability between multilingual systems in order to overcome crucial issues like cross-language and cross-collection retrieval. Indeed, the LG methodology tries to overcome the shortcomings of statistical approaches as in Google Translate or Bing by Microsoft concerning Multi-Word Unit (MWU) processing in queries, where the lack of linguistic context represents a serious obstacle to disambiguation. In particular, translations concerning specific domains, as it is has been widely recognized, is unambiguous since the meanings of terms are mono-referential and the type of relation that links a given term to its equivalent in a foreign language is biunivocal, i.e. a one-to-one coupling which causes this relation to be exclusive and reversible. Ontologies are used in CLIR and are considered by several scholars a promising research area to improve the effectiveness of Information Extraction (IE) techniques particularly for technical-domain queries. Therefore, we present a methodological framework which allows to map both the data and the metadata among the language-specific onto
Keywords :
history; knowledge management; meta data; natural language processing; ontologies (artificial intelligence); open systems; Bing; CLIR; English-Italian language pair; European Library; Europeana; Google Translate; IE techniques; LG approach; MWU processing; Microsoft; NLP applications; cross-lingual information retrieval strategies; cultural content; cultural heritage repositories; foreign language; information extraction techniques; institutions; knowledge management; language-specific ontologies; lexicon-grammar approach; metadata; methodological framework; multilingual systems; multiword unit processing; natural language processing applications; one-to-one coupling; providers; semantic interoperability; semantic search; statistical approaches; technical-domain queries; Compounds; Dictionaries; Grammar; Ontologies; Pragmatics; Resource description framework; Semantics; Cross-Lingual Information Retrieval; Knowledge Management; Ontologies;
Conference_Titel :
Digital Heritage International Congress (DigitalHeritage), 2013
Conference_Location :
Marseille
Print_ISBN :
978-1-4799-3168-2
DOI :
10.1109/DigitalHeritage.2013.6744767