Lifelong Policy Gradient Learning of Factored Policies for Faster Training Without Forgetting Jorge A. Mendez
–Neural Information Processing Systems
Policy gradient methods have shown success in learning control policies for high-dimensional dynamical systems. Their biggest downside is the amount of exploration they require before yielding high-performing policies.
Neural Information Processing Systems
Nov-14-2025, 22:34:34 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America
- Canada > Ontario (0.04)
- United States > Pennsylvania (0.04)
- Asia > Middle East
- Industry:
- Education > Educational Setting (0.47)
- Government (0.68)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks (1.00)
- Representation & Reasoning (0.68)
- Robots (0.93)
- Information Technology > Artificial Intelligence