• DocumentCode
    3076782
  • Title

    Data Fusion: Boosting Performance in Keyword Extraction

  • Author

    Bohne, Thomas ; Borghoff, Uwe M.

  • Author_Institution
    Comput. Sci. Dept., Univ. der Bundeswehr Munchen, Munich, Germany
  • fYear
    2013
  • fDate
    22-24 April 2013
  • Firstpage
    166
  • Lastpage
    173
  • Abstract
    In a time when volatile data is in constant growth, the importance of keyword extraction becomes particularly evident. Keywords can quickly identify, structure and reveal potentially worthwhile information. The quality of automatically extracted keywords reflects the individual characteristics of the various retrieval approaches that may be used for extraction. A combinatorial approach using multiple heuristic keyword extraction algorithms may enhance the quality of the results significantly, though it may also compound the inherent limitations. In our paper we compare different ranking aggregation and data fusion methods for single documents. Furthermore we apply principal component analysis to determine an optimal selection of retrieval algorithms for combination with respect to the use-case. To validate our approach, we provide a statistical evaluation with real-world examples.
  • Keywords
    document handling; information retrieval; principal component analysis; sensor fusion; automatic keyword extraction; boosting performance; data fusion; multiple heuristic keyword extraction algorithms; principal component analysis; ranking aggregation methods; real-world examples; retrieval algorithms; single documents; statistical evaluation; Algorithm design and analysis; Data mining; Eigenvalues and eigenfunctions; Frequency measurement; Heuristic algorithms; Mathematical model; Principal component analysis; algorithm combination; extraction; principal component analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering of Computer Based Systems (ECBS), 2013 20th IEEE International Conference and Workshops on the
  • Conference_Location
    Scottsdale, AZ
  • Print_ISBN
    978-0-7695-4991-0
  • Type

    conf

  • DOI
    10.1109/ECBS.2013.12
  • Filename
    6601585