Improving Generative Methods for Causal Evaluation via Simulation-Based Inference