A million-scale dataset and generalizable foundation model for nanomaterial-protein interactions