DocumentCode :
3076782
Title :
Data Fusion: Boosting Performance in Keyword Extraction
Author :
Bohne, Thomas ; Borghoff, Uwe M.
Author_Institution :
Comput. Sci. Dept., Univ. der Bundeswehr Munchen, Munich, Germany
fYear :
2013
fDate :
22-24 April 2013
Firstpage :
166
Lastpage :
173
Abstract :
In a time when volatile data is in constant growth, the importance of keyword extraction becomes particularly evident. Keywords can quickly identify, structure and reveal potentially worthwhile information. The quality of automatically extracted keywords reflects the individual characteristics of the various retrieval approaches that may be used for extraction. A combinatorial approach using multiple heuristic keyword extraction algorithms may enhance the quality of the results significantly, though it may also compound the inherent limitations. In our paper we compare different ranking aggregation and data fusion methods for single documents. Furthermore we apply principal component analysis to determine an optimal selection of retrieval algorithms for combination with respect to the use-case. To validate our approach, we provide a statistical evaluation with real-world examples.
Keywords :
document handling; information retrieval; principal component analysis; sensor fusion; automatic keyword extraction; boosting performance; data fusion; multiple heuristic keyword extraction algorithms; principal component analysis; ranking aggregation methods; real-world examples; retrieval algorithms; single documents; statistical evaluation; Algorithm design and analysis; Data mining; Eigenvalues and eigenfunctions; Frequency measurement; Heuristic algorithms; Mathematical model; Principal component analysis; algorithm combination; extraction; principal component analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering of Computer Based Systems (ECBS), 2013 20th IEEE International Conference and Workshops on the
Conference_Location :
Scottsdale, AZ
Print_ISBN :
978-0-7695-4991-0
Type :
conf
DOI :
10.1109/ECBS.2013.12
Filename :
6601585
Link To Document :
بازگشت