Goto

Collaborating Authors

 violin


The Text Aphasia Battery (TAB): A Clinically-Grounded Benchmark for Aphasia-Like Deficits in Language Models

Roll, Nathan, Kries, Jill, Jin, Flora, Wang, Catherine, Finley, Ann Marie, Sumner, Meghan, Shain, Cory, Gwilliams, Laura

arXiv.org Artificial Intelligence

Large language models (LLMs) have emerged as a candidate "model organism" for human language, offering an unprecedented opportunity to study the computational basis of linguistic disorders like aphasia. However, traditional clinical assessments are ill-suited for LLMs, as they presuppose human-like pragmatic pressures and probe cognitive processes not inherent to artificial architectures. We introduce the Text Aphasia Battery (TAB), a text-only benchmark adapted from the Quick Aphasia Battery (QAB) to assess aphasic-like deficits in LLMs. The TAB comprises four subtests: Connected Text, Word Comprehension, Sentence Comprehension, and Repetition. This paper details the TAB's design, subtests, and scoring criteria. To facilitate large-scale use, we validate an automated evaluation protocol using Gemini 2.5 Flash, which achieves reliability comparable to expert human raters (prevalence-weighted Cohen's kappa = 0.255 for model--consensus agreement vs. 0.286 for human--human agreement). We release TAB as a clinically-grounded, scalable framework for analyzing language deficits in artificial systems.


Non-Linear Model-Based Sequential Decision-Making in Agriculture

Arya, Sakshi, Lin, Wentao

arXiv.org Machine Learning

Sequential decision-making is central to sustainable agricultural management and precision agriculture, where resource inputs must be optimized under uncertainty and over time. However, such decisions must often be made with limited observations, whereas classical bandit and reinforcement learning approaches typically rely on either linear or black-box reward models that may misrepresent domain knowledge or require large amounts of data. We propose a family of nonlinear, model-based bandit algorithms that embed domain-specific response curves directly into the exploration-exploitation loop. By coupling (i) principled uncertainty quantification with (ii) closed-form or rapidly computable profit optima, these algorithms achieve sublinear regret and near-optimal sample complexity while preserving interpretability. Theoretical analysis establishes regret and sample complexity bounds, and extensive simulations emulating real-world fertilizer-rate decisions show consistent improvements over both linear and nonparametric baselines (such as linear UCB and $k$-NN UCB) in the low-sample regime, under both well-specified and shape-compatible misspecified models. Because our approach leverages mechanistic insight rather than large data volumes, it is especially suited to resource-constrained settings, supporting sustainable, inclusive, and transparent sequential decision-making across agriculture, environmental management, and allied applications. This methodology directly contributes to SDG 2 (Zero Hunger) and SDG 12 (Responsible Consumption and Production) by enabling data-driven, less wasteful agricultural practices.


Induction Heads as an Essential Mechanism for Pattern Matching in In-context Learning

Crosbie, J., Shutova, E.

arXiv.org Artificial Intelligence

As Large language models have shown a remarkable a significant milestone in this area, Elhage et al. ability to learn and perform complex tasks through (2021) demonstrated the existence of induction in-context learning (ICL) (Brown et al., 2020; Touvron heads in Transformer LMs. These heads scan the et al., 2023b). In ICL, the model receives context for previous instances of the current token a demonstration context and a query question as using a prefix matching mechanism, which identifies a prompt for prediction. Unlike supervised learning, if and where a token has appeared before. ICL utilises the pretrained model's capabilities If a matching token is found, the head employs to recognise and replicate patterns within the a copying mechanism to increase the probability demonstration context, thereby enabling accurate of the subsequent token, facilitating exact or approximate predictions for the query without the use of gradient repetition of sequences and embodying updates.


Transformer-based Joint Source Channel Coding for Textual Semantic Communication

Liu, Shicong, Gao, Zhen, Chen, Gaojie, Su, Yu, Peng, Lu

arXiv.org Artificial Intelligence

The Space-Air-Ground-Sea integrated network calls for more robust and secure transmission techniques against jamming. In this paper, we propose a textual semantic transmission framework for robust transmission, which utilizes the advanced natural language processing techniques to model and encode sentences. Specifically, the textual sentences are firstly split into tokens using wordpiece algorithm, and are embedded to token vectors for semantic extraction by Transformer-based encoder. The encoded data are quantized to a fixed length binary sequence for transmission, where binary erasure, symmetric, and deletion channels are considered for transmission. The received binary sequences are further decoded by the transformer decoders into tokens used for sentence reconstruction. Our proposed approach leverages the power of neural networks and attention mechanism to provide reliable and efficient communication of textual data in challenging wireless environments, and simulation results on semantic similarity and bilingual evaluation understudy prove the superiority of the proposed model in semantic transmission.


World champion 'speedcuber' claims the violin has aided in his success with Rubik's Cubes

FOX News

Fox News Flash top headlines are here. Check out what's clicking on Foxnews.com. A University of Michigan student is one of the world's foremost "speedcubers," a person capable of quickly solving a Rubik's Cube. He also is an accomplished violinist. Stanley Chapel says the two fields go hand in hand.


Extract fundamental frequency based on CNN combined with PYIN

Xing, Ruowei, Li, Shengchen

arXiv.org Artificial Intelligence

This paper refers to the extraction of multiple fundamental frequencies (multiple F0) based on PYIN, an algorithm for extracting the fundamental frequency (F0) of monophonic music, and a trained convolutional neural networks (CNN) model, where a pitch salience function of the input signal is produced to estimate the multiple F0. The implementation of these two algorithms and their corresponding advantages and disadvantages are discussed in this article. Analysing the different performance of these two methods, PYIN is applied to supplement the F0 extracted from the trained CNN model to combine the advantages of these two algorithms. For evaluation, four pieces played by two violins are used, and the performance of the models are evaluated accoring to the flatness of the F0 curve extracted. The result shows the combined model outperforms the original algorithms when extracting F0 from monophonic music and polyphonic music.


ECLIPSE: Efficient Long-range Video Retrieval using Sight and Sound

Lin, Yan-Bo, Lei, Jie, Bansal, Mohit, Bertasius, Gedas

arXiv.org Artificial Intelligence

We introduce an audiovisual method for long-range text-to-video retrieval. Unlike previous approaches designed for short video retrieval (e.g., 5-15 seconds in duration), our approach aims to retrieve minute-long videos that capture complex human actions. One challenge of standard video-only approaches is the large computational cost associated with processing hundreds of densely extracted frames from such long videos. To address this issue, we propose to replace parts of the video with compact audio cues that succinctly summarize dynamic audio events and are cheap to process. Our method, named ECLIPSE (Efficient CLIP with Sound Encoding), adapts the popular CLIP model to an audiovisual video setting, by adding a unified audiovisual transformer block that captures complementary cues from the video and audio streams. In addition to being 2.92x faster and 2.34x memory-efficient than long-range video-only approaches, our method also achieves better text-to-video retrieval accuracy on several diverse long-range video datasets such as ActivityNet, QVHighlights, YouCook2, DiDeMo and Charades.


Video Games Offered My Son a Haven From Bullying

WIRED

My husband and I weren't sure what started the bullying. Was it because he'd stood up to the bully who called his Black friend a "slave" and demanded he carry his cello? Our son had faced racism early--when a drunk white guy demanded his tiny 6-year-old sister return to China, where we'd adopted her. Luke stood up for her too. Whatever caused the bullying, what matters most was how he finally conquered it.

  Country: Asia > China (0.26)
  Industry: Leisure & Entertainment > Games > Computer Games (0.73)

These impossible instruments could change the future of music

MIT Technology Review

What Sassoon had heard were the early results of a curious project at the University of Edinburgh in Scotland, where Ducceschi was a researcher at the time. The Next Generation Sound Synthesis, or NESS, team had pulled together mathematicians, physicists, and computer scientists to produce the most lifelike digital music ever created, by running hyper-realistic simulations of trumpets, guitars, violins, and more on a supercomputer. Sassoon, who works with both orchestral and digital music, "trying to smash the two together," was hooked. He became a resident composer with NESS, traveling back and forth between Milan and Edinburgh for the next few years. It was a steep learning curve.


Using Artificial Intelligence in violin making

#artificialintelligence

The computer screen shows a height map of the 1716 'Messiah' Stradivari violin, taken from a 3D scan (left); and an X-ray of the ribs of the 1718 'San Lorenzo' Stradivari (right) The following extract is from The Strad's September 2021 issue feature'Violin Making and AI: Intelligent Design'. To read it in full, click here to subscribe and login. The idea that the shape and thickness of a violin's top and back plates can affect its sound is nothing new. Antonio Stradivari was undoubtedly aware of it 300 years ago, and the science behind it was scrutinised and written up at length by Carleen Hutchins in the 1950s. Since then, the phenomenon of violin'modes' and resonances has been investigated by both violin makers and academics; indeed, for many luthiers, one of the first steps in making a new instrument will be to examine its'tap tones', or speed of sound along the plates.