• DocumentCode
    1027151
  • Title

    Kernels for Generalized Multiple-Instance Learning

  • Author

    Tao, Qingping ; Scott, Stephen D. ; Vinodchandran, N.V. ; Osugi, Thomas Takeo ; Mueller, Brandon

  • Author_Institution
    GC Image, LLC, Lincoln, NE
  • Volume
    30
  • Issue
    12
  • fYear
    2008
  • Firstpage
    2084
  • Lastpage
    2098
  • Abstract
    The multiple-instance learning (MIL) model has been successful in numerous application areas. Recently, a generalization of this model and an algorithm for it were introduced, showing significant advantages over the conventional MIL model on certain application areas. Unfortunately, that algorithm is not scalable to high dimensions. We adapt that algorithm to one using a support vector machine with our new kernel kwedge. This reduces the time complexity from exponential in the dimension to polynomial. Computing our new kernel is equivalent to counting the number of boxes in a discrete, bounded space that contain at least one point from each of two multisets. We show that this problem is #P-complete, but then give a fully polynomial randomized approximation scheme (FPRAS) for it. We then extend k by enriching its representation into a new kernel kmin, and also consider a normalized version of kwedge that we call k (which may or may not not be a kernel, but whose approximation yielded positive semidefinite Gram matrices in practice). We then empirically evaluate all three measures on data from content-based image retrieval, biological sequence analysis, and the musk data sets. We found that our kernels performed well on all data sets relative to algorithms in the conventional MIL model.
  • Keywords
    approximation theory; computational complexity; learning (artificial intelligence); support vector machines; #P-complete; FPRAS; MIL; biological sequence analysis; content-based image retrieval; fully polynomial randomized approximation scheme; generalized multiple-instance learning; musk data sets; support vector machine; time complexity; Machine learning; generalized multiple-instance learning; kernels; support vector machines; Algorithms; Artificial Intelligence; Computer Simulation; Models, Theoretical; Pattern Recognition, Automated;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2007.70846
  • Filename
    4420086