Goto

Collaborating Authors

 cxn


Dancing with Deer: A Constructional Perspective on MWEs in the Era of LLMs

arXiv.org Artificial Intelligence

In this chapter, we argue for the benefits of understanding multiword expressions from the perspective of usage-based, construction grammar approaches. We begin with a historical overview of how construction grammar was developed in order to account for idiomatic expressions using the same grammatical machinery as the non-idiomatic structures of language. We cover a comprehensive description of constructions, which are pairings of meaning with form of any size (morpheme, word, phrase), as well as how constructional approaches treat the acquisition and generalization of constructions. We describe a successful case study leveraging constructional templates for representing multiword expressions in English PropBank. Because constructions can be at any level or unit of form, we then illustrate the benefit of a constructional representation of multi-meaningful morphosyntactic unit constructions in Arapaho, a highly polysynthetic and agglutinating language. We include a second case study leveraging constructional templates for representing these multi-morphemic expressions in Uniform Meaning Representation. Finally, we demonstrate the similarities and differences between a usage-based explanation of a speaker learning a novel multiword expression, such as "dancing with deer," and that of a large language model. We present experiments showing that both models and speakers can generalize the meaning of novel multiword expressions based on a single exposure of usage. However, only speakers can reason over the combination of two such expressions, as this requires comparison of the novel forms to a speaker's lifetime of stored constructional exemplars, which are rich with cross-modal details.


Constraining constructions with WordNet: pros and cons for the semantic annotation of fillers in the Italian Constructicon

arXiv.org Artificial Intelligence

The paper discusses the role of WordNet-based semantic classification in the formalization of constructions, and more specifically in the semantic annotation of schematic fillers, in the Italian Constructicon. We outline how the Italian Constructicon project uses Open Multilingual WordNet topics to represent semantic features and constraints of constructions.


Assessing Language Comprehension in Large Language Models Using Construction Grammar

arXiv.org Artificial Intelligence

Large Language Models, despite their significant capabilities, are known to fail in surprising and unpredictable ways. Evaluating their true `understanding' of language is particularly challenging due to the extensive web-scale data they are trained on. Therefore, we construct an evaluation to systematically assess natural language understanding (NLU) in LLMs by leveraging Construction Grammar (CxG), which provides insights into the meaning captured by linguistic elements known as constructions (Cxns). CxG is well-suited for this purpose because provides a theoretical basis to construct targeted evaluation sets. These datasets are carefully constructed to include examples which are unlikely to appear in pre-training data, yet intuitive and easy for humans to understand, enabling a more targeted and reliable assessment. Our experiments focus on downstream natural language inference and reasoning tasks by comparing LLMs' understanding of the underlying meanings communicated through 8 unique Cxns with that of humans. The results show that while LLMs demonstrate some knowledge of constructional information, even the latest models including GPT-o1 struggle with abstract meanings conveyed by these Cxns, as demonstrated in cases where test sentences are dissimilar to their pre-training data. We argue that such cases provide a more accurate test of true language understanding, highlighting key limitations in LLMs' semantic capabilities. We make our novel dataset and associated experimental data including prompts and model responses publicly available.


Annotating Constructions with UD: the experience of the Italian Constructicon

arXiv.org Artificial Intelligence

These constraint of fact, linguistic resources are mostly available can be expressed in what we call conll-c for the morphosyntactic level of analysis and the format, a newly introduced convention to maximize UD infrastructure represents the de facto standard compatibility with conll-u. Cxns formalized in this area, providing a framework which has been in conll-c can be translated into a set of employed to represent the most diverse set of languages grew queries and be used to automatically match (both typologically as well as in terms of UD-parsed sentences in order to find cxn instances.


Cambridge Audio CXN (v2) network audio streamer review: This is a sweet-sounding, high-tech musical powerhouse

PCWorld

A network audio streamer is designed to serve as a central hub for all your digital audio sources. Cambridge Audio's $1,099 CXN (v2) network audio streamer is arguably among the best you'll find. The CXN (v2) marries a super-rich feature set, copious connectivity options, broad support for multiple ecosystems, and solid audio performance into a near-perfect package that will fit into just about any scenario. Even better, regular firmware updates promise to add features and functionality, making for an investment that you'll enjoy for years to come. The CXN (v2) is designed to sit at the heart of a high-performance audio system.