• DocumentCode
    735377
  • Title

    Evaluation on geospatial information extraction and retrieval: Mining thematic maps from web source

  • Author

    Dewandaru, Agung ; Supriana S, Iping ; Akbar, Saiful

  • Author_Institution
    Sch. of Electr. Eng. & Inf., Bandung Inst. of Technol., Bandung, Indonesia
  • fYear
    2015
  • fDate
    27-29 May 2015
  • Firstpage
    283
  • Lastpage
    288
  • Abstract
    The World Wide Web easily becomes the largest repository of natural language text data. We are particularly interested in state-of-the-art methods in exploiting geospatial information the web. The survey is done in the context of its extraction methods, retrieval, visualization, and further possible mining or knowledge discovery scenarios in order to produce thematic maps automatically from the web corpus. We found that Web-based Geographic Information Retrieval (GIR) methods that returns selected relevant area instead of points is still lacking, even though area modeling is common in GIS. We also found that most GIR methods is still focused on places and buildings instead of theme or information around some area. Thus it indicates that the state of the art GIR methods are not yet sufficient for thematic extraction and retrieval to generate thematic maps from web corpus. Bayesian topic models such as Latent Dirichlet Allocation may serve as a good basis to serve such use cases.
  • Keywords
    Internet; cartography; data mining; geographic information systems; information retrieval; Bayesian topic models; GIR method; Web source; geospatial information exploitation; geospatial information extraction; geospatial information retrieval; knowledge discovery; latent Dirichlet allocation; natural language text data; thematic maps mining; Context; Data mining; Geospatial analysis; Information retrieval; Measurement; Natural languages; Prototypes; geographic information retrieval; information extraction; information retrieval; information visualization; knowledge discovery; thematic extraction; thematic maps; topic modeling; web mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information and Communication Technology (ICoICT ), 2015 3rd International Conference on
  • Conference_Location
    Nusa Dua
  • Type

    conf

  • DOI
    10.1109/ICoICT.2015.7231437
  • Filename
    7231437