Data Readiness for Scientific AI at Scale