Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization