High-Power Training Data Identification with Provable Statistical Guarantees