Author/Authors :
Xinhai Liu1، نويسنده , , 2، نويسنده , ,
Shi Yu1، نويسنده , ,
Frizo Janssens1، نويسنده , ,
Wolfgang Gl?nzel3، نويسنده , , 4، نويسنده , ,
Yves Moreau1، نويسنده , ,
Bart De Moor، نويسنده ,
Abstract :
We propose a new hybrid clustering framework to incorporate text mining with bibliometrics in journal set analysis. The framework integrates two different approaches: clustering ensemble and kernel-fusion clustering. To improve the flexibility and the efficiency of processing large-scale data, we propose an information-based weighting scheme to leverage the effect of multiple data sources in hybrid clustering. Three different algorithms are extended by the proposed weighting scheme and they are employed on a large journal set retrieved from the Web of Science (WoS) database. The clustering performance of the proposed algorithms is systematically evaluated using multiple evaluation methods, and they were cross-compared with alternative methods. Experimental results demonstrate that the proposed weighted hybrid clustering strategy is superior to other methods in clustering performance and efficiency. The proposed approach also provides a more refined structural mapping of journal sets, which is useful for monitoring and detecting new trends in different scientific fields.