2 Preliminaries Computational graphLet A be a deterministic algorithm and letFA be a set of deterministic primitiveoperations that can be used byA during execution. Given an inputx, wedefine the

Feb-7-2026, 22:15:13 GMT–Neural Information Processing Systems

We analyze the capabilities of Transformer language models in learning compositional discrete tasks. To this end, we evaluate training LLaMA models and prompting GPT-4 and Gemini on four tasks demanding to learn a composition of several discrete sub-tasks. In particular, we measure how well these models can reuse primitives observable in the sub-tasks to learn the composition task.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Feb-7-2026, 22:15:13 GMT

Conferences PDF

Add feedback

Country:
- Asia
  - Middle East > Jordan (0.04)
  - Singapore (0.04)
- Europe
  - Germany > Baden-Württemberg
    - Tübingen Region > Tübingen (0.04)
  - Switzerland > Zürich
    - Zürich (0.04)
- North America
  - Canada > Ontario
    - Toronto (0.04)
  - United States > Massachusetts
    - Suffolk County > Boston (0.04)

Genre:
- Research Report (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Natural Language > Large Language Model (1.00)
  - Representation & Reasoning (1.00)

Duplicate Docs Excel Report

Title
0e797d5139ad94fc2dc2080c09119f29-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found