DocumentCode
3076782
Title
Data Fusion: Boosting Performance in Keyword Extraction
Author
Bohne, Thomas ; Borghoff, Uwe M.
Author_Institution
Comput. Sci. Dept., Univ. der Bundeswehr Munchen, Munich, Germany
fYear
2013
fDate
22-24 April 2013
Firstpage
166
Lastpage
173
Abstract
In a time when volatile data is in constant growth, the importance of keyword extraction becomes particularly evident. Keywords can quickly identify, structure and reveal potentially worthwhile information. The quality of automatically extracted keywords reflects the individual characteristics of the various retrieval approaches that may be used for extraction. A combinatorial approach using multiple heuristic keyword extraction algorithms may enhance the quality of the results significantly, though it may also compound the inherent limitations. In our paper we compare different ranking aggregation and data fusion methods for single documents. Furthermore we apply principal component analysis to determine an optimal selection of retrieval algorithms for combination with respect to the use-case. To validate our approach, we provide a statistical evaluation with real-world examples.
Keywords
document handling; information retrieval; principal component analysis; sensor fusion; automatic keyword extraction; boosting performance; data fusion; multiple heuristic keyword extraction algorithms; principal component analysis; ranking aggregation methods; real-world examples; retrieval algorithms; single documents; statistical evaluation; Algorithm design and analysis; Data mining; Eigenvalues and eigenfunctions; Frequency measurement; Heuristic algorithms; Mathematical model; Principal component analysis; algorithm combination; extraction; principal component analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Engineering of Computer Based Systems (ECBS), 2013 20th IEEE International Conference and Workshops on the
Conference_Location
Scottsdale, AZ
Print_ISBN
978-0-7695-4991-0
Type
conf
DOI
10.1109/ECBS.2013.12
Filename
6601585
Link To Document