Forward and Backward State Abstractions for Off-policy Evaluation