ReinFlow: Fine-tuning Flow Matching Policy with Online Reinforcement Learning

Jun-20-2026, 07:41:35 GMT–Neural Information Processing Systems

We propose ReinFlow, a simple yet effective online reinforcement learning (RL) framework that fine-tunes a family of flow matching policies for continuous robotic control. Derived from rigorous RL theory, ReinFlow injects learnable noise into a flow policy's deterministic path, converting the flow into a discrete-time Markov Process for exact and straightforward likelihood computation. This conversion facilitates exploration and ensures training stability, enabling ReinFlow to fine-tune diverse flow model variants stably, including Rectified Flow [34] and Shortcut Models [18], particularly at very few or even one denoising step.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Jun-20-2026, 07:41:35 GMT

Conferences PDF

Add feedback

Country:
- Asia (0.67)
- North America > United States (0.67)

Genre:
- Instructional Material (1.00)
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Education (0.45)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Neural Networks (1.00)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.88)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found