Data Distributional Properties As Inductive Bias for Systematic Generalization