DocumentCode :
1615797
Title :
Structured Poi data extraction from Internet news
Author :
Zhang, Hua-Ping ; Mo, Qian ; Huang, He-Yan
Author_Institution :
Beijing Inst. of Technol. (BIT), Beijing, China
fYear :
2010
Firstpage :
116
Lastpage :
122
Abstract :
POI (Point of Interest)database is key resources for GIS (Geographic Information System) application. POI manual gathering is expensive and time consuming. This paper presents a state-of-the-art solution that automatically extracts structured POI data from Internet news. The procedure includes making lexical analysis news Internet, and then identifying time expression, location and organization entities, extracting an event scenario based on POI heuristic features. With POI data extraction, consistency between event and entity, result optimization and filtering with heuristics was taken into account. Open testing with experiment conducted on 1,000 news, the precision is 93.60% and recall is 75.48%. The method within POI oriented event extraction is effective and has been applied in industrial POI collection.
Keywords :
Internet; data handling; feature extraction; geographic information systems; information analysis; Internet news; POI data extraction; POI heuristic features; POI manual gathering; Point of Interest database; event extraction; event scenario extraction; geographic information system application; lexical analysis news; open testing; structured PoI data extraction; Data mining; Feature extraction; Geographic Information Systems; Natural languages; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Universal Communication Symposium (IUCS), 2010 4th International
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-7821-7
Type :
conf
DOI :
10.1109/IUCS.2010.5666648
Filename :
5666648
Link To Document :
بازگشت