An Information-Theoretic Evaluation of Generative Models in Learning Multi-modal Distributions