A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health Nikhil Behari MIT, Harvard University Edwin Zhang
–Neural Information Processing Systems
RMAB environment, and (3) iterate on the generated reward functions using feedback from grounded RMAB simulations.
Neural Information Processing Systems
Oct-11-2025, 00:04:38 GMT
- Country:
- Africa > Nigeria (0.04)
- Asia
- India (0.04)
- Middle East > Jordan (0.04)
- Europe > Netherlands (0.04)
- South America > Peru
- Loreto Department (0.04)
- Genre:
- Overview (0.67)
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Industry:
- Health & Medicine
- Public Health (1.00)
- Therapeutic Area
- Immunology (0.45)
- Infections and Infectious Diseases (0.67)
- Obstetrics/Gynecology (0.46)
- Health & Medicine
- Technology: