A Vector Symbolic Approach to Multiple Instance Learning
Dhrubo, Ehsan Ahmed, Alam, Mohammad Mahmudul, Raff, Edward, Oates, Tim, Holt, James
–arXiv.org Artificial Intelligence
Multiple Instance Learning (MIL) tasks impose a strict logical constraint: a bag is labeled positive if and only if at least one instance within it is positive. While this iff constraint aligns with many real-world applications, recent work has shown that most deep learning-based MIL approaches violate it, leading to inflated performance metrics and poor generalization. We propose a novel MIL framework based on Vector Symbolic Architectures (VSAs), which provide a differentiable mechanism for performing symbolic operations in high-dimensional space. Our method encodes the MIL assumption directly into the model's structure by representing instances and concepts as nearly orthogonal high-dimensional vectors and using algebraic operations to enforce the iff constraint during classification. To bridge the gap between raw data and VSA representations, we design a learned encoder that transforms input instances into VSA-compatible vectors while preserving key distributional properties. Our approach, which includes a VSA-driven MaxNetwork classifier, achieves state-of-the-art results for a valid MIL model on standard MIL benchmarks and medical imaging datasets, outperforming existing methods while maintaining strict adherence to the MIL formulation. This work offers a principled, interpretable, and effective alternative to existing MIL approaches that rely on learned heuristics.
arXiv.org Artificial Intelligence
Nov-24-2025
- Country:
- Europe
- Sweden > Stockholm
- Stockholm (0.04)
- Switzerland (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Sweden > Stockholm
- North America > United States
- California > San Francisco County
- San Francisco (0.14)
- Maryland
- Baltimore (0.04)
- Baltimore County (0.04)
- California > San Francisco County
- Oceania > New Zealand
- North Island > Waikato (0.04)
- Europe
- Genre:
- Research Report (1.00)
- Industry:
- Health & Medicine
- Diagnostic Medicine > Imaging (0.88)
- Therapeutic Area > Oncology (1.00)
- Health & Medicine
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science (1.00)
- Machine Learning
- Neural Networks > Deep Learning (1.00)
- Statistical Learning (1.00)
- Natural Language (1.00)
- Representation & Reasoning (1.00)
- Vision (1.00)
- Information Technology > Artificial Intelligence