Goto

Collaborating Authors

 Education




Scaling Laws in Linear Regression: Compute, Parameters, and Data

Neural Information Processing Systems

From the perspective of statistical learning theory, (1) is rather intriguing. Moreover, they do not provide instance-wise matching lower bounds to verify the tightness of the upper bounds.



Training Code Language Models with Comprehensive Semantics Reasoning

Neural Information Processing Systems

Code Large Language Models (Code LLMs) have excelled at tasks like code completion but often miss deeper semantics such as execution effects and dynamic states. This paper aims to bridge the gap between Code LLMs' reliance on static text data