D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models

Open in new window