AITopics | Trott, Sean

Collaborating Authors

Trott, Sean

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Measuring and Modifying the Readability of English Texts with GPT-4

Trott, Sean, Rivière, Pamela D.

arXiv.org Artificial IntelligenceOct-17-2024

The success of Large Language Models (LLMs) in other domains has raised the question of whether LLMs can reliably assess and manipulate the readability of text. We approach this question empirically. First, using a published corpus of 4,724 English text excerpts, we find that readability estimates produced ``zero-shot'' from GPT-4 Turbo and GPT-4o mini exhibit relatively high correlation with human judgments (r = 0.76 and r = 0.74, respectively), out-performing estimates derived from traditional readability formulas and various psycholinguistic indices. Then, in a pre-registered human experiment (N = 59), we ask whether Turbo can reliably make text easier or harder to read. We find evidence to support this hypothesis, though considerable variance in human judgments remains unexplained. We conclude by discussing the limitations of this approach, including limited scope, as well as the validity of the ``readability'' construct and its dependence on context, audience, and goal.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2410.14028

Country:

Asia (0.94)
North America > United States > Wisconsin > Dane County > Madison (0.14)
Europe > Middle East > Malta > Eastern Region > Northern Harbour District > St. Julian's (0.14)

Genre: Research Report (1.00)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Bidirectional Transformer Representations of (Spanish) Ambiguous Words in Context: A New Lexical Resource and Empirical Analysis

Rivière, Pamela D., Beatty-Martínez, Anne L., Trott, Sean

arXiv.org Artificial IntelligenceJun-20-2024

Lexical ambiguity -- where a single wordform takes on distinct, context-dependent meanings -- serves as a useful tool to compare across different large language models' (LLMs') ability to form distinct, contextualized representations of the same stimulus. Few studies have systematically compared LLMs' contextualized word embeddings for languages beyond English. Here, we evaluate multiple bidirectional transformers' (BERTs') semantic representations of Spanish ambiguous nouns in context. We develop a novel dataset of minimal-pair sentences evoking the same or different sense for a target ambiguous noun. In a pre-registered study, we collect contextualized human relatedness judgments for each sentence pair. We find that various BERT-based LLMs' contextualized semantic representations capture some variance in human judgments but fall short of the human benchmark, and for Spanish -- unlike English -- model scale is uncorrelated with performance. We also identify stereotyped trajectories of target noun disambiguation as a proportion of traversal through a given LLM family's architecture, which we partially replicate in English. We contribute (1) a dataset of controlled, Spanish sentence stimuli with human relatedness norms, and (2) to our evolving understanding of the impact that LLM specification (architectures, training protocols) exerts on contextualized embeddings.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2406.14678

Country:

Europe > Spain (0.14)
North America > United States (0.14)
Europe > Italy (0.14)
(4 more...)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)

Add feedback

Different Tokenization Schemes Lead to Comparable Performance in Spanish Number Agreement

Arnett, Catherine, Rivière, Pamela D., Chang, Tyler A., Trott, Sean

arXiv.org Artificial IntelligenceMar-20-2024

The relationship between language model tokenization and performance is an open area of research. Here, we investigate how different tokenization schemes impact number agreement in Spanish plurals. We find that morphologically-aligned tokenization performs similarly to other tokenization schemes, even when induced artificially for words that would not be tokenized that way during training. We then present exploratory analyses demonstrating that language model embeddings for different plural tokenizations have similar distributions along the embedding space axis that maximally distinguishes singular and plural nouns. Our results suggest that morphologically-aligned tokenization is a viable tokenization approach, and existing models already generalize some morphological patterns to new items. However, our results indicate that morphological tokenization is not strictly required for performance.

machine learning, natural language, tokenization, (16 more...)

arXiv.org Artificial Intelligence

2403.13754

Country:

Europe > Germany (0.14)
Europe > Belgium (0.14)
Asia > Middle East > UAE (0.14)

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.46)

Add feedback

Do Large Language Models know what humans know?

Trott, Sean, Jones, Cameron, Chang, Tyler, Michaelov, James, Bergen, Benjamin

arXiv.org Artificial IntelligenceMay-31-2023

Humans can attribute beliefs to others. However, it is unknown to what extent this ability results from an innate biological endowment or from experience accrued through child development, particularly exposure to language describing others' mental states. We test the viability of the language exposure hypothesis by assessing whether models exposed to large quantities of human language display sensitivity to the implied knowledge states of characters in written passages. In pre-registered analyses, we present a linguistic version of the False Belief Task to both human participants and a Large Language Model, GPT-3. Both are sensitive to others' beliefs, but while the language model significantly exceeds chance behavior, it does not perform as well as the humans, nor does it explain the full extent of their behavior -- despite being exposed to more language than a human would in a lifetime. This suggests that while statistical learning from language exposure may in part explain how humans develop the ability to reason about the mental states of others, other mechanisms are also responsible.

language model know, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2209.01515

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Theoretical Concerns for the Integration of Repair

Trott, Sean (University of California, San Diego) | Rossano, Federico (University of California, San Diego)

AAAI ConferencesOct-31-2017

Human conversation is messy. Speakers frequently repair their speech, and listeners must therefore integrate information across ill-formed, often fragmentary inputs. Previous dialogue systems for human-robot interaction (HRI) have addressed certain problems in dialogue repair, but there are many problems that remain. In this paper, we discuss these problems from the perspective of Conversation Analysis, and argue that a more holistic account of dialogue repair will actually aid in the design and implementation of machine dialogue systems.

integration, theoretical concern

AAAI Conferences

2017 AAAI Fall Symposium Series

Technology:

Information Technology > Artificial Intelligence > Robots (0.87)
Information Technology > Human Computer Interaction > Human Robot Interaction (0.53)

Add feedback

A Theoretical Model of Indirect Request Comprehension

Trott, Sean (University of California, San Diego) | Bergen, Benjamin (University of California, San Diego)

AAAI ConferencesOct-31-2017

Natural human dialogue often contains ambiguous or indirect speech. This poses a unique challenge to language understanding systems because comprehension requires going beyond what is said to what is implied. In this paper, we survey related work on the particularly challenging case of understanding non-conventional indirect speech acts, then propose a more generalizable rule rooted in building a mental model of the speaker. Finally, we discuss experimental evidence pointing to the cognitive plausibility of this rule.

indirect request comprehension, theoretical model

AAAI Conferences

2017 AAAI Fall Symposium Series

Technology: Information Technology > Artificial Intelligence > Natural Language (0.53)

Add feedback

Natural Language Understanding and Communication for Multi-Agent Systems

Trott, Sean (International Computer Science Institute) | Appriou, Aurélien (International Computer Science Institute) | Feldman, Jerome (International Computer Science Institute) | Janin, Adam (International Computer Science Institute)

AAAI ConferencesNov-1-2015

Natural Language Understanding (NLU) studies machine language comprehension and action without human intervention. We describe an implemented system that supports deep semantic NLU for controlling systems with multiple simulated robot agents. The system supports bidirectional communication for both human-agent and agent-agent inter-action. This interaction is achieved with the use of N-tuples, a novel form of Agent Communication Language using shared protocols with content expressing actions or intentions. The system’s portability and flexibility is facilitated by its division into unchanging “core” and “application-specific” components.

artificial intelligence, n-tuple, natural language, (14 more...)

AAAI Conferences

2015 AAAI Fall Symposium Series

Country: North America > United States > Oregon (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback