Text-AwareDiffusionforPolicyLearning

Feb-13-2026, 06:42:00 GMT–Neural Information Processing Systems

Training an agent to achieve particular goals or perform desired behaviors is often accomplished through reinforcement learning, especially in the absence of expert demonstrations. However, supporting novel goals or behaviors through reinforcement learning requires the ad-hoc design of appropriate reward functions, which quickly becomes intractable. Toaddress thischallenge, wepropose Text-AwareDiffusion forPolicyLearning (TADPoLe), which uses apretrained, frozen text-conditioned diffusion model to compute dense zero-shot reward signals for text-aligned policy learning.

large language model, machine learning, reinforcement learning, (20 more...)

Neural Information Processing Systems

Feb-13-2026, 06:42:00 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (0.48)
  - Machine Learning
    - Reinforcement Learning (0.68)
    - Neural Networks (0.46)

Duplicate Docs Excel Report

Title
5227ce00add5aa0a12d1c4ee92fcd2dc-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found