Can LLMs Implicitly Learn Numeric Parameter Constraints in Data Science APIs?

May-29-2025, 18:24:07 GMT–Neural Information Processing Systems

Data science (DS) programs, typically built on popular DS libraries (such as Py-Torch and NumPy) with thousands of APIs, serve as the cornerstone for various mission-critical domains such as financial systems, autonomous driving software, and coding assistants. Recently, large language models (LLMs) have been widely applied to generate DS programs across diverse scenarios, such as assisting users for DS programming or detecting critical vulnerabilities in DS frameworks. Such applications have all operated under the assumption, that LLMs can implicitly model the numerical parameter constraints in DS library APIs and produce valid code. However, this assumption has not been rigorously studied in the literature. In this paper, we empirically investigate the proficiency of LLMs to handle these implicit numerical constraints when generating DS programs.

constraint, large language model, machine learning, (21 more...)

Neural Information Processing Systems

May-29-2025, 18:24:07 GMT

Conferences PDF

Add feedback

Country:
- Asia > Middle East
  - UAE (0.14)
- North America > United States
  - California (0.14)
  - Illinois (0.14)

Genre:
- Research Report
  - Experimental Study (0.93)
  - New Finding (1.00)

Industry:
- Information Technology > Security & Privacy (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Natural Language > Large Language Model (1.00)