• DocumentCode
    730338
  • Title

    Removing data with noisy responses in regression analysis

  • Author

    Wisler, Alan ; Berisha, Visar ; Ramamurthy, Karthikeyan ; Spanias, Andreas ; Liss, Julie

  • Author_Institution
    Dept. of SHS, ASU, Tempe, AZ, USA
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    2066
  • Lastpage
    2070
  • Abstract
    In regression analysis, outliers in the data can induce a bias in the learned function, resulting in larger errors. In this paper we derive an empirically estimable bound on the regression error based on a Euclidean minimum spanning tree generated from the data. Using this bound as motivation, we propose an iterative approach to remove data with noisy responses from the training set. We evaluate the performance of the algorithm on experiments with real-world pathological speech (speech from individuals with neurogenic disorders). Comparative results show that removing noisy examples during training using the proposed approach yields better predictive performance on out-of- sample data.
  • Keywords
    iterative methods; regression analysis; speech processing; Euclidean minimum spanning tree; data outliers; iterative approach; neurogenic disorders; noisy responses; real world pathological speech; regression analysis; regression error; Accuracy; Classification algorithms; Training data; Friedman-Rafsky statistic; minimum spanning tree; noisy data; outlier removal; robust regression;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178334
  • Filename
    7178334