Simplicity bias and optimization threshold in two-layer ReLU networks