levine
Country:
- North America > United States > New York > New York County > New York City (0.05)
- Europe > Sweden > Stockholm > Stockholm (0.05)
- Asia > Middle East > Jordan (0.05)
- (6 more...)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.51)
Country:
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- North America > United States > Texas (0.04)
- North America > United States > Illinois > Cook County > Chicago (0.04)
- (3 more...)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)
Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Country:
- Europe > Austria (0.04)
- North America > United States > Maryland > Baltimore (0.04)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- (9 more...)
Genre:
- Research Report > Experimental Study (1.00)
- Workflow (0.67)
Technology:
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
However, existing offline RL methods tend to behave poorly during fine-tuning. In this paper, we study the fine-tuning problem in the context of conservative offline RL methods and we devise an approach for learning an effective initialization from offline data that also enables fast online fine-tuning capabilities.
Country:
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- North America > United States > Montana (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Asia > China > Guangdong Province > Shenzhen (0.04)
Technology:
Country:
- Asia > Middle East > Jordan (0.04)
- Asia > China (0.04)
Technology:
Country:
- Asia > China > Beijing > Beijing (0.05)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
- Asia > Middle East > Jordan (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Country:
- North America > United States > Massachusetts (0.04)
- North America > Canada > Quebec > Montreal (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.71)