latent-ode
Country:
- North America > United States > North Carolina (0.04)
- North America > United States > California > Alameda County > Berkeley (0.04)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Country:
- North America > United States > California > Alameda County > Berkeley (0.14)
- North America > United States > North Carolina (0.04)
- North America > United States > New York (0.04)
- (2 more...)
Industry:
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.73)
- Health & Medicine > Therapeutic Area > Immunology > HIV (0.48)
Technology:
Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs
Du, Jianzhun, Futoma, Joseph, Doshi-Velez, Finale
We present two elegant solutions for modeling continuous-time dynamics, in a novel model-based reinforcement learning (RL) framework for semi-Markov decision processes (SMDPs), using neural ordinary differential equations (ODEs). Our models accurately characterize continuous-time dynamics and enable us to develop high-performing policies using a small amount of data. We also develop a model-based approach for optimizing time schedules to reduce interaction rates with the environment while maintaining the near-optimal performance, which is not possible for model-free methods. We experimentally demonstrate the efficacy of our methods across various continuous-time domains.
2006.1621
Country:
- North America > United States > California > Alameda County > Berkeley (0.14)
- North America > United States > North Carolina (0.04)
- North America > United States > New York (0.04)
- (2 more...)
Industry:
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
- Health & Medicine > Therapeutic Area > Immunology > HIV (0.48)
Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.70)