GoalConditionedReinforcementLearningforPhoto FinishingTuning

Neural Information Processing Systems 

Previousworkseitheruse zeroth-order optimization, which is slow when the set of parameters increases, or rely on a differentiable proxy of the target finishing pipeline, which is hard to train.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found