• DocumentCode
    41852
  • Title

    FINNIM: Iterative Imputation of Missing Values in Dissolved Gas Analysis Dataset

  • Author

    Sahri, Zahriah ; Yusof, Rubiyah ; Watada, Junzo

  • Author_Institution
    Univ. Teknikal Malaysia Melaka, Durian Tunggal, Malaysia
  • Volume
    10
  • Issue
    4
  • fYear
    2014
  • fDate
    Nov. 2014
  • Firstpage
    2093
  • Lastpage
    2102
  • Abstract
    Missing values are a common occurrence in a number of real world databases, and statistical methods have been developed to deal with this problem, referred to as missing data imputation. In the detection and prediction of incipient faults in power transformers using dissolved gas analysis (DGA), the problem of missing values is significant and has resulted in inconclusive decision-making. This study proposes an efficient nonparametric iterative imputation method named FINNIM, which comprises of three components: 1) the imputation ordering; 2) the imputation estimator; and 3) the iterative imputation. The relationship between gases and faults, and the percentage of missing values in an instance are used as a basis for the imputation ordering; whereas the plausible values for the missing values are estimated from k-nearest neighbor instances in the imputation estimator, and the iterative imputation allows complete and incomplete instances in a DGA dataset to be utilized iteratively for imputing all the missing values. Experimental results on both artificially inserted and actual missing values found in a few DGA datasets demonstrate that the proposed method outperforms the existing methods in imputation accuracy, classification performance, and convergence criteria at different missing percentages.
  • Keywords
    decision making; iterative methods; pattern classification; power engineering computing; power transformers; DGA dataset; FINNIM; classification performance; databases; dissolved gas analysis dataset; imputation estimator; imputation ordering; incipient fault detection; incipient fault prediction; inconclusive decision-making; k-nearest neighbor instances; missing data imputation; missing values; nonparametric iterative imputation method; power transformers; statistical methods; Accuracy; Convergence; Fault detection; Gases; Iterative methods; Power transformers; $bm{k}$ -nearest neighbor ( $bm{k}$ NN); Dissolved gas analysis (DGA); imputation ordering; iterative imputation; missing data imputation; missing values;
  • fLanguage
    English
  • Journal_Title
    Industrial Informatics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1551-3203
  • Type

    jour

  • DOI
    10.1109/TII.2014.2350837
  • Filename
    6882199