DocumentCode :
3231130
Title :
The role of gazetteers in geographic knowledge discovery on the Web
Author :
Souza, Ligiane A. ; Davis, Clodoveu A., Jr. ; Borges, Karla A V ; Delboni, Tiago M. ; Laender, Alberto H F
Author_Institution :
Departamento de Ciencia da Computacao, Univ. Fed. de Minas Gerais, Brazil
fYear :
2005
fDate :
31 Oct.-2 Nov. 2005
Abstract :
The Web is a large source of geographic information. Many Web documents have one or more spatial references, such as place names, addresses, zip codes or phone numbers. These spatial references are usually found in a semistructured fashion, which allows humans to identify and assign a geographic meaning to documents. In this paper, we discuss the important role that gazetteers, which are spatial catalogues of place names, can play in automating this process, and introduce the Locus gazetteer. Locus has been designed to hold not only place names for entities such as cities and rivers, but also to handle intra-urban place names, such as street names, urban landmarks, and postal addresses, along with their spatial relationships, through an ontology of places. We demonstrate that ontologically-enhanced gazetteers, such as Locus, are very useful for discovering the geographic context present on Web pages, and are often used in many other applications, such as in address geocoding for geographic information systems. To efficiently accomplish these tasks, the gazetteer must have a large database of spatial references; however, such a database is hard to obtain in emergent countries such as Brazil, in which available official geographic databases are limited and not well updated. As a way to tackle this problem, we describe a semi-automatic method used to populate the Locus gazetteer with geographic content extracted directly from the Web. To evaluate our work, an experiment was conducted, focusing on testing the Locus gazetteer data quality and comprehensiveness.
Keywords :
Internet; data mining; geographic information systems; search engines; visual databases; Locus gazetteer; Web document; Web page; geographic database; geographic information system; geographic knowledge discovery; intra-urban place name; ontologically-enhanced gazetteer; semiautomatic method; spatial reference database; Cities and towns; Data mining; Geographic Information Systems; Humans; Information retrieval; Ontologies; Rivers; Search engines; Spatial databases; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Congress, 2005. LA-WEB 2005. Third Latin American
Print_ISBN :
0-7695-2471-0
Type :
conf
DOI :
10.1109/LAWEB.2005.38
Filename :
1592372
Link To Document :
بازگشت