Title :
A Comparison of Multiple Instance and Group Based Learning
Author :
Brossi, S.D. ; Bradley, Andrew P.
Author_Institution :
Sch. of Inf. Technol. & Electr. Eng., Univ. of Queensland, St. Lucia, QLD, Australia
Abstract :
In this paper we compare the performance of a number of multiple-instance learning (MIL) and group based (GB) classification algorithms on both a synthetic and real-world Pap smear dataset. We utilise the synthetic dataset to demonstrate that performance improves as both bag size and percent positives increase and that MIL outperforms GB algorithms when the percentage positives is less than 50%. However, as the positive bags become increasingly homogeneous, as is apparent on the real-world dataset, the two approaches become comparable. This result highlights that the performance of a MIL or GB algorithm will be maximised when the algorithm´s MIL assumption matches the reality of the dataset. Therefore, on the Pap smear dataset, algorithms with a more generalised MIL assumption demonstrate the strongest performance.
Keywords :
learning (artificial intelligence); pattern classification; performance evaluation; GB classification algorithm; MIL; bag size; group-based classification algorithms; group-based learning; multiple-instance learning; percentage positives; performance improvement; real-world Pap smear dataset; synthetic Pap smear dataset; Accuracy; Drugs; Pathology; Standards; Supervised learning; Training; Vectors;
Conference_Titel :
Digital Image Computing Techniques and Applications (DICTA), 2012 International Conference on
Conference_Location :
Fremantle, WA
Print_ISBN :
978-1-4673-2180-8
Electronic_ISBN :
978-1-4673-2179-2
DOI :
10.1109/DICTA.2012.6411737