Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization

Open in new window