• DocumentCode
    145268
  • Title

    On Machine Learning with Imbalanced Data and Research Quality Evaluation Methodologies

  • Author

    Lipitakis, Anastasia-Dimitra ; Lipitakis, Evanglia A. E. C.

  • Volume
    1
  • fYear
    2014
  • fDate
    10-13 March 2014
  • Firstpage
    451
  • Lastpage
    457
  • Abstract
    In this article a synoptic review of machine learning techniques with imbalanced data and a class of corresponding learning algorithms is presented. This class of algorithms includes the meta-algorithms: Cost sensitive, Metacost, Rotation forest-cost sensitive, rotation forest-smote. Four learning algorithms (with base classifiers J48 and part processing with F-measure and a predetermined imbalanced data set) are compared in the computational environment WEKA leading to comparative numerical results. The basic concepts of research quality evaluation methodologies are presented, an adaptive citation qualitative-quantitative approach and advanced bibliometric indicators are given. Basic components of research quality performance such as research journal cited publications, citing publications and research quality evaluations at various academic levels are considered and corresponding numerical results are given. An alternative approach using certain machine learning algorithms with imbalanced data in the case of research quality evaluation methodologies is proposed.
  • Keywords
    information analysis; learning (artificial intelligence); pattern classification; F-measure; J48 classifier; WEKA environment; adaptive citation qualitative-quantitative approach; bibliometric indicators; cost sensitive learning algorithm; imbalanced data; machine learning techniques; metacost learning algorithm; part processing; research quality evaluation methodologies; rotation forest-cost sensitive learning algorithm; rotation forest-smote learning algorithm; Algorithm design and analysis; Artificial intelligence; Bibliometrics; Business; Classification algorithms; Educational institutions; Machine learning algorithms; Bibliometric Indicators; Business Intelligence; Citation Analysis; Computational Intelligence; Data Mining; Imbalanced Data; Learning Algorithms; Machine Learning; Quantitative Methods; Research Quality Evaluation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Science and Computational Intelligence (CSCI), 2014 International Conference on
  • Conference_Location
    Las Vegas, NV
  • Type

    conf

  • DOI
    10.1109/CSCI.2014.81
  • Filename
    6822151