Simplicity bias and optimization threshold in two-layer ReLU networks

Open in new window