A Surprising Linear Relationship Predicts Test Performance in Deep Networks