Reply to "Emergent LLM behaviors are observationally equivalent to data leakage"