مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

612184

Title :

Taming the metadata mess

Author :

Megler, V.M. ; Maier, David

Author_Institution :

Dept. of Comput. Sci., Portland State Univ., Portland, OR, USA

fYear :

2013

fDate :

8-12 April 2013

Firstpage :

286

Lastpage :

289

Abstract :

The rapid growth of scientific data shows no sign of abating. This growth has led to a new problem: with so much scientific data at hand, stored in thousands of datasets, how can scientists find the datasets most relevant to their research interests? We have addressed this problem by adapting Information Retrieval techniques, developed for searching text documents, into the world of (primarily numeric) scientific data. We propose an approach that uses a blend of automated and “semi-curated” methods to extract metadata from large archives of scientific data, then evaluates ranked searches over this metadata. We describe a challenge identified during an implementation of our approach: the large and expanding list of environmental variables captured by the archive do not match the list of environmental variables in the minds of the scientists. We briefly characterize the problem and describe our initial thoughts on resolving it.

Keywords :

environmental factors; information retrieval; meta data; scientific information systems; environmental variables; information retrieval techniques; metadata extraction; metadata mess taming; research interest; scientific data; semicurated methods; text document searching; Conferences; Context; Databases; Prototypes; Search engines; Semantics;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Data Engineering Workshops (ICDEW), 2013 IEEE 29th International Conference on

Conference_Location :

Brisbane, QLD

Print_ISBN :

978-1-4673-5303-8

Electronic_ISBN :

978-1-4673-5302-1

Type :

conf

DOI :

10.1109/ICDEW.2013.6547465

Filename :

6547465

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=612184