Off-Policy Evaluation for Action-Dependent Non-Stationary Environments (Appendix)