Geometric Inductive Biases of Deep Networks: The Role of Data and Architecture