Bandit Data-driven Optimization: AI for Social Good and Beyond
Shi, Zheyuan Ryan, Wu, Zhiwei Steven, Ghani, Rayid, Fang, Fei
–arXiv.org Artificial Intelligence
The use of machine learning (ML) systems in real-world applications entails more than just a prediction algorithm. AI for social good applications, and many real-world ML tasks in general, feature an iterative process which joins prediction, optimization, and data acquisition happen in a loop. We introduce bandit data-driven optimization, the first iterative prediction-prescription framework to formally analyze this practical routine. Bandit data-driven optimization combines the advantages of online bandit learning and offline predictive analytics in an integrated framework. It offers a flexible setup to reason about unmodeled policy objectives and unforeseen consequences. We propose PROOF, the first algorithm for this framework and show that it achieves no-regret. Using numerical simulations, we show that PROOF achieves superior performance over existing baseline.
arXiv.org Artificial Intelligence
Aug-26-2020
- Country:
- North America > United States (0.28)
- Genre:
- Instructional Material (0.34)
- Research Report (0.40)
- Industry:
- Social Sector (1.00)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning > Neural Networks (0.46)
- Representation & Reasoning (0.95)
- Data Science > Data Mining (1.00)
- Game Theory (0.93)
- Artificial Intelligence
- Information Technology