Stochastic LLMs do not Understand Language: Towards Symbolic, Explainable and Ontologically Based LLMs
–arXiv.org Artificial Intelligence
In our opinion the exuberance surrounding the relative success of datadriven large language models (LLMs) is slightly misguided and for several reasons (i) LLMs cannot be relied upon for factual information since for LLMs all ingested text (factual or non-factual) was created equal; (ii) due to their subsymbolic nature, whatever'knowledge' these models acquire about language will always be buried in billions of microfeatures (weights), none of which is meaningful on its own; and (iii) LLMs will often fail to make the correct inferences in several linguistic contexts (e.g., nominal compounds, copredication, quantifier scope ambiguities, intensional contexts. Since we believe the relative success of data-driven large language models (LLMs) is not a reflection on the symbolic vs. subsymbolic debate but a reflection on applying the successful strategy of a bottom-up reverse engineering of language at scale, we suggest in this paper applying the effective bottom-up strategy in a symbolic setting resulting in symbolic, explainable, and ontologically grounded language models. Keywords: Bottom-up reverse engineering of language, Symbolic large language models, Language Agnostic Ontology.
arXiv.org Artificial Intelligence
Sep-14-2023
- Country:
- North America > United States
- Arizona (0.05)
- New York > Tompkins County
- Ithaca (0.04)
- Maine > Cumberland County
- Portland (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Europe
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Cambridgeshire > Cambridge (0.04)
- Spain > Galicia
- Madrid (0.06)
- Germany > North Rhine-Westphalia
- Düsseldorf Region > Düsseldorf (0.04)
- United Kingdom > England
- North America > United States
- Genre:
- Research Report (0.40)
- Technology: