Feature Learning and Generalization in Deep Networks with Orthogonal Weights