• DocumentCode
    2142188
  • Title

    A VSM-based data mining engine for geoscience documents

  • Author

    Lv, Peng ; Bi, Zhiwei ; Zhu, Pengfei ; Wu, Wen

  • Author_Institution
    Institute of remote sensing applications, China Academy of Sciences, Beijing, China
  • fYear
    2010
  • fDate
    4-6 Dec. 2010
  • Firstpage
    4909
  • Lastpage
    4912
  • Abstract
    With the development of information technology in geosciences, enormous data and documents can not be processed by ordinary methods. Furthermore it is difficult to precisely search the target document quickly. In this paper, we propose the use of vector space model (VSM) for automatic date mining of geosciences documents, and a VSM-based search engine system is designed and implemented, which includes three main components: 1)a word segment structure with two hash tables managing the first and the last words of a geo-item and a Trie tree containing the rest of words; 2) a linear space composited by all related documents which need the calculating of similarity; 3) a vector space module mapping documents to multi-dimensional vector space and comparing keywords with features of documents to decide the similarity. This system can make it convenient in geodata sharing and improves the work process efficiently.
  • Keywords
    Adaptation model; Computational modeling; Data mining; Engines; Geology; Information retrieval; Vectors; VSM; date mining; geoscience documents; trie tree;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science and Engineering (ICISE), 2010 2nd International Conference on
  • Conference_Location
    Hangzhou, China
  • Print_ISBN
    978-1-4244-7616-9
  • Type

    conf

  • DOI
    10.1109/ICISE.2010.5690957
  • Filename
    5690957