• DocumentCode
    427850
  • Title

    Quantizing features independently in the Bayesian data reduction algorithm

  • Author

    Lynch, Robert S., Jr. ; Willett, Peter K.

  • Author_Institution
    Signal Process. Branch, Naval Undersea Warfare Center, Newport, RI, USA
  • Volume
    2
  • fYear
    2004
  • fDate
    10-13 Oct. 2004
  • Firstpage
    1336
  • Abstract
    In this paper, the Bayesian data reduction algorithm (BDRA) is modified to quantize each feature (i.e., those that are continuous valued and not naturally discrete, or categorical), independently, allowing a best set of thresholds to be found for all dimensions contained in the data. The algorithm works by initially quantizing each feature with at least ten discrete levels (via percentiles), where the BDRA then proceeds to reduce this to a number of levels yielding best performance. The BDRA then trains on all independently quantized features simultaneously, finding the best overall quantization complexity of the data that minimizes the training error. Results are demonstrated illustrating the performance of the new modified algorithm for various data sets found at the University of California at Irvine´s (UCI) repository of machine learning databases.
  • Keywords
    belief networks; data reduction; learning (artificial intelligence); Bayesian data reduction algorithm; machine learning databases; quantization complexity; training error; Bayesian methods; Machine learning; Machine learning algorithms; Quantization; Signal processing algorithms; Spatial databases; Supervised learning; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2004 IEEE International Conference on
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-8566-7
  • Type

    conf

  • DOI
    10.1109/ICSMC.2004.1399811
  • Filename
    1399811