cf5a019ae9c11b4be88213ce3f85d85c-Paper-Conference.pdf

Feb-12-2026, 00:23:17 GMT–Neural Information Processing Systems

Here, we focus on a more practical setting in object rearrangement,i.e., rearranging objects from shuffled layouts to a normative target distribution without explicit goal specification. However, it remains challenging for AI agents, as it is hard to describe the target distribution (goal specification) for reward engineering or collect expert trajectories as demonstrations. Hence, it is infeasible to directly employ reinforcement learning or imitation learning algorithms to address the task. This paper aims to search for a policy only with a set of examples from a target distribution instead of a handcrafted reward function. We employ the score-matching objectiveto train aTargetGradientField (TarGF),indicating a direction on each object to increase the likelihood of the target distribution.

machine learning, reinforcement learning, sac, (15 more...)

Neural Information Processing Systems

Feb-12-2026, 00:23:17 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > Illinois > Cook County > Chicago (0.04)

Genre:
- Research Report (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Reinforcement Learning (0.68)

Duplicate Docs Excel Report

Title
TarGF: Learning Target Gradient Field to Rearrange Objects without Explicit Goal Specification Mingdong Wu

Similar Docs Excel Report more

Title	Similarity	Source
None found