A polar coordinate system represents syntax in large language models

May-25-2025, 16:18:51 GMT–Neural Information Processing Systems

Originally formalized with symbolic representations, syntactic trees may also be effectively represented in the activations of large language models (LLMs). Indeed, a "Structural Probe" can find a subspace of neural activations, where syntacticallyrelated words are relatively close to one-another. However, this syntactic code remains incomplete: the distance between the Structural Probe word embeddings can represent the existence but not the type and direction of syntactic relations. Here, we hypothesize that syntactic relations are, in fact, coded by the relative direction between nearby embeddings. To test this hypothesis, we introduce a "Polar Probe" trained to read syntactic relations from both the distance and the direction between word embeddings.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

May-25-2025, 16:18:51 GMT

Conferences PDF

Add feedback

Country:
- Asia > Middle East
  - UAE (0.14)
- Europe > France (0.14)

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.94)
  - Natural Language > Large Language Model (1.00)