Learning Consistent Deep Generative Models from Sparse Data via Prediction Constraints