A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health Nikhil Behari MIT, Harvard University Edwin Zhang

Oct-11-2025, 00:04:38 GMT–Neural Information Processing Systems

RMAB environment, and (3) iterate on the generated reward functions using feedback from grounded RMAB simulations.

category, reflection, reward function, (14 more...)

Neural Information Processing Systems

Oct-11-2025, 00:04:38 GMT

Conferences PDF

Country:
- Europe > Netherlands (0.04)
- Africa > Nigeria (0.04)
- Asia
  - India (0.04)
  - Middle East > Jordan (0.04)

Genre:
- Overview (0.67)
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Health & Medicine
  - Public Health (1.00)
  - Therapeutic Area
    - Infections and Infectious Diseases (0.67)
    - Obstetrics/Gynecology (0.46)
    - Immunology (0.45)

Technology:
- Information Technology
  - Data Science > Data Mining (1.00)
  - Artificial Intelligence
    - Natural Language (1.00)
    - Machine Learning > Reinforcement Learning (0.93)
    - Representation & Reasoning > Agents (0.93)

Duplicate Docs Excel Report

Title
074f42212be2c8ee651db00f17965ec4-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found