Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
–Neural Information Processing Systems
Optimization of parameterized policies for reinforcement learning (RL) is an important and challenging problem in artificial intelligence.
Neural Information Processing Systems
Nov-21-2025, 03:56:43 GMT
- Country:
- North America
- United States
- Texas (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- California > San Francisco County
- San Francisco (0.14)
- Canada
- Quebec > Montreal (0.04)
- Alberta > Census Division No. 11
- Edmonton Metropolitan Region > Edmonton (0.04)
- United States
- Europe
- Spain > Canary Islands (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Asia > Middle East
- Jordan (0.04)
- North America
- Genre:
- Overview (0.46)
- Industry:
- Leisure & Entertainment > Games > Computer Games (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Representation & Reasoning > Agents (1.00)
- Games (0.94)
- Machine Learning
- Reinforcement Learning (1.00)
- Neural Networks > Deep Learning (0.46)
- Information Technology > Artificial Intelligence