Exploiting Model Uncertainty Estimates for Safe Dynamic Control Learning
–Neural Information Processing Systems
Figure 2: The task is to move the cart to the origin as quickly as possible without dropping the pole. The bottom three pictures show a trace of the policy execution obtained after one, two, and three trials (shown in increments of 0.5 seconds) Controller Number of data points used Cost from initial state 17 to build the controller LQR
Neural Information Processing Systems
Dec-31-1997
- Country:
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
- Technology: