GoalConditionedReinforcementLearningforPhoto FinishingTuning
–Neural Information Processing Systems
Previousworkseitheruse zeroth-order optimization, which is slow when the set of parameters increases, or rely on a differentiable proxy of the target finishing pipeline, which is hard to train.
Neural Information Processing Systems
Feb-13-2026, 07:02:05 GMT
- Technology: