A Supplementary Material A.1 Proof Theorem 1. Assuming a family

Neural Information Processing Systems 

The classifier for density ratio estimation is based on deep neural networks. Our algorithm and baselines are based on the identical outcome prediction network architecture. The results are reported in Table 4. A.5 Results of synthetic experiments under misspecification of the dimension of latent space The dimension of latent factors is hardly known in many scenarios.