GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values