Local Optimization for Simulation of Natural Motion
Erez, Tom (Washington University in St. Louis)
I intend to use RL to bring the two together, The Reinforcement Learning (RL) agent interacts with a dynamical and generate motion from the proposed first principles system whose states capture all the relevant information in realistic biomechanical models, and compare the about the current configuration of the agent and its results to the behavior of living creatures. This is a nontrivial environment. By specifying a sequence of actions, the agent problem: biomechanical models are continuous, highdimensional alters the state transitions of this dynamical system. The optimality and nonlinear, and the optimality criteria considered criterion is formalized by a reward function defined in the literature are non-quadratic. In order to address over state-action pairs, and the agent's goal is to maximize these profound challenges, I propose three basic principles the cumulative reward.
Jul-12-2010
- Country:
- North America > United States
- Massachusetts > Middlesex County > Cambridge (0.05)
- Asia > Japan
- Honshū > Chūbu > Nagano Prefecture > Nagano (0.05)
- North America > United States
- Industry:
- Health & Medicine (0.32)
- Technology: