QGFN: Controllable Greediness with Action Values

Lau, Elaine, Lu, Stephen Zhewen, Pan, Ling, Precup, Doina, Bengio, Emmanuel

Feb-7-2024–arXiv.org Artificial Intelligence

Generative Flow Networks (GFlowNets; GFNs) are a family of reward/energy-based generative methods for combinatorial objects, capable of generating diverse and high-utility samples. However, biasing GFNs towards producing high-utility samples is non-trivial. In this work, we leverage connections between GFNs and reinforcement learning (RL) and propose to combine the GFN policy with an action-value estimate, $Q$, to create greedier sampling policies which can be controlled by a mixing parameter. We show that several variants of the proposed method, QGFN, are able to improve on the number of high-reward samples generated in a variety of tasks without sacrificing diversity.

controllable greediness, qgfn, variant, (14 more...)

arXiv.org Artificial Intelligence

Feb-7-2024

arXiv.org PDF

Add feedback

Country:
- North America > Canada
  - Quebec (0.04)
- Europe > Austria
  - Vienna (0.04)
- Asia > China
  - Hong Kong (0.04)

Genre:
- Research Report (0.82)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (0.93)
  - Machine Learning
    - Neural Networks (0.93)
    - Reinforcement Learning (0.89)
    - Statistical Learning (0.67)