On Margin Maximization in Linear and ReLU Networks