What Data Enables Optimal Decisions? An Exact Characterization for Linear Optimization
Bennouna, Omar, Bennouna, Amine, Amin, Saurabh, Ozdaglar, Asuman
–arXiv.org Artificial Intelligence
We study the fundamental question of how informative a dataset is for solving a given decision-making task. In our setting, the dataset provides partial information about unknown parameters that influence task outcomes. Focusing on linear programs, we characterize when a dataset is sufficient to recover an optimal decision, given an uncertainty set on the cost vector. Our main contribution is a sharp geometric characterization that identifies the directions of the cost vector that matter for optimality, relative to the task constraints and uncertainty set. We further develop a practical algorithm that, for a given task, constructs a minimal or least-costly sufficient dataset. Our results reveal that small, well-chosen datasets can often fully determine optimal decisions -- offering a principled foundation for task-aware data selection.
arXiv.org Artificial Intelligence
May-29-2025
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- California > Los Angeles County
- Santa Monica (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New York > New York County
- New York City (0.14)
- California > Los Angeles County
- Europe > United Kingdom
- Genre:
- Research Report > New Finding (0.34)
- Technology: