Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
–Neural Information Processing Systems
Optimization of parameterized policies for reinforcement learning (RL) is an important and challenging problem in artificial intelligence.
Neural Information Processing Systems
Feb-15-2026, 00:24:32 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe
- Spain > Canary Islands (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- North America
- Canada
- Alberta > Census Division No. 11
- Edmonton Metropolitan Region > Edmonton (0.04)
- Quebec > Montreal (0.04)
- Alberta > Census Division No. 11
- United States
- California > San Francisco County
- San Francisco (0.14)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Texas (0.04)
- California > San Francisco County
- Canada
- Asia > Middle East
- Genre:
- Overview (0.46)
- Industry:
- Leisure & Entertainment > Games > Computer Games (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Games (0.94)
- Machine Learning
- Neural Networks > Deep Learning (0.46)
- Reinforcement Learning (1.00)
- Representation & Reasoning > Agents (1.00)
- Information Technology > Artificial Intelligence