How does a Neural Network's Architecture Impact its Robustness to Noisy Labels?