Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing

Chen Liang, Mohammad Norouzi, Jonathan Berant, Quoc V. Le, Ni Lao

Nov-20-2025, 21:14:06 GMT–Neural Information Processing Systems

MAPO improves the sample efficiency and robustness of policy gradient, especially on tasks with sparse rewards.

mapo, memory buffer, trajectory, (14 more...)

Neural Information Processing Systems

Nov-20-2025, 21:14:06 GMT

Conferences PDF

Country:
- North America > Canada
  - Quebec > Montreal (0.04)
- Europe
  - Hungary (0.04)
  - Germany (0.04)
  - Finland (0.04)
- Asia
  - Thailand (0.04)
  - China (0.04)
  - Middle East
    - Jordan (0.04)
    - Israel > Tel Aviv District
      - Tel Aviv (0.04)

Industry:
- Education (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Natural Language > Grammars & Parsing (1.00)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Neural Networks > Deep Learning (0.94)

Duplicate Docs Excel Report

Title
Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing
Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing

Similar Docs Excel Report more

Title	Similarity	Source
None found