Promises and Pitfalls of Threshold-based Auto-labeling