Learning Action and Reasoning-Centric Image Editing from Videos and Simulation

Mar-19-2026, 23:46:09 GMT–Neural Information Processing Systems

An image editing model should be able to perform diverse edits, ranging from object replacement, changing attributes or style, to performing actions or movement, which require many forms of reasoning. Current instruction-guided editing models have significant shortcomings with action and reasoning-centric edits.Object, attribute or stylistic changes can be learned from visually static datasets. On the other hand, high-quality data for action and reasoning-centric edits is scarce and has to come from entirely different sources that cover e.g.

artificial intelligence, machine learning, proceedings, (10 more...)

Neural Information Processing Systems

Mar-19-2026, 23:46:09 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (0.38)
  - Machine Learning (0.37)