Reproducibility in Multiple Instance Learning: A Case For Algorithmic Unit Tests

May-28-2025, 18:12:17 GMT–Neural Information Processing Systems

Multiple Instance Learning (MIL) is a sub-domain of classification problems with positive and negative labels and a "bag" of inputs, where the label is positive if and only if a positive element is contained within the bag, and otherwise is negative. Training in this context requires associating the bag-wide label to instance-level information, and implicitly contains a causal assumption and asymmetry to the task (i.e., you can't swap the labels without changing the semantics). MIL problems occur in healthcare (one malignant cell indicates cancer), cyber security (one malicious executable makes an infected computer), and many other tasks. In this work, we examine five of the most prominent deep-MIL models and find that none of them respects the standard MIL assumption. They are able to learn anticorrelated instances, i.e., defaulting to "positive" labels until seeing a negative counter-example, which should not be possible for a correct MIL model.

artificial intelligence, assumption, machine learning, (13 more...)

Neural Information Processing Systems

May-28-2025, 18:12:17 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > Maryland (0.28)

Industry:
- Health & Medicine (1.00)
- Information Technology > Security & Privacy (1.00)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning
      - Neural Networks (0.47)
      - Statistical Learning > Support Vector Machines (0.46)
    - Representation & Reasoning (1.00)
  - Security & Privacy (1.00)

Duplicate Docs Excel Report

Title
Reproducibility in Multiple Instance Learning: A Case For Algorithmic Unit Tests

Similar Docs Excel Report more

Title	Similarity	Source
None found