VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution using Reinforcement Learning
Awasthi, Raghav, Guliani, Keerat Kaur, Bhatt, Arshita, Gill, Mehrab Singh, Nagori, Aditya, Kumaraguru, Ponnurangam, Sethi, Tavpritesh
–arXiv.org Artificial Intelligence
A COVID-19 vaccine is our best bet for mitigating the ongoing onslaught of the pandemic. However, vaccine is also expected to be a limited resource. An optimal allocation strategy, especially in countries with access inequities and a temporal separation of hot-spots might be an effective way of halting the disease spread. We approach this problem by proposing a novel pipeline VacSIM that dovetails Actor-Critic using Kronecker-Factored Trust Region (ACKTR) model into a Contextual Bandits approach for optimizing the distribution of COVID-19 vaccine. Whereas the ACKTR model suggests better actions and rewards, Contextual Bandits allow online modifications that may need to be implemented on a day-to-day basis in the real world scenario. We evaluate this framework against a naive allocation approach of distributing vaccine proportional to the incidence of COVID-19 cases in five different States across India and demonstrate up to 100,000 additional lives potentially saved and a five-fold increase in the efficacy of limiting the spread over a period of 30 days through the VacSIM approach. We also propose novel evaluation strategies including a standard compartmental model based projections and a causality preserving evaluation of our model. Finally, we contribute a new Open-AI environment meant for the vaccine distribution scenario, and open-source VacSIM for wide testing and applications across the globe.
arXiv.org Artificial Intelligence
Sep-14-2020
- Country:
- Asia
- India
- Jharkhand (0.05)
- Maharashtra (0.05)
- NCT > New Delhi (0.04)
- Nagaland (0.05)
- Uttarakhand > Roorkee (0.04)
- Middle East > Jordan (0.04)
- India
- Europe > United Kingdom (0.04)
- North America > United States
- New York (0.04)
- Asia
- Genre:
- Research Report (0.85)
- Industry: