Emmanuel Abbe

May-29-2025, 00:18:25 GMT–Neural Information Processing Systems

Can Transformers predict new syllogisms by composing established ones? More generally, what type of targets can be learned by such models from scratch? Recent works show that Transformers can be Turing-complete in terms of expressivity, but this does not address the learnability objective. This paper puts forward the notion of globality degree of a target distribution to capture when weak learning is efficiently achievable by regular Transformers.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

May-29-2025, 00:18:25 GMT

Conferences PDF

Add feedback

Country:
- Europe (0.14)
- North America > United States (0.14)
- South America (0.14)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Education (0.92)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Natural Language > Large Language Model (1.00)
  - Representation & Reasoning (1.00)