AITopics | goldberg

Collaborating Authors

goldberg

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

9d8ed3c9e27a9265ee60c8edba3dec1d-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 04:06:43 GMT

artificial intelligence, fixation, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.29)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.15)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback

Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain)

Mariya Toneva, Leila Wehbe

Neural Information Processing SystemsFeb-12-2026, 15:07:17 GMT

Weusebrainimagingrecordings ofsubjectsreading complex natural text to interpret word and sequence embeddings from4 recent NLP models - ELMo, USE, BERT and Transformer-XL. We study how their representations differ across layer depth, contextlength, and attention type.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.05)
North America > Canada > Quebec > Montreal (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > Middle East > Qatar > Ad-Dawhah > Doha (0.04)

Genre: Research Report (0.68)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.98)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.31)

Add feedback

Mother of Elon Musk's child sues his AI company over Grok deepfake images

Al JazeeraJan-17-2026, 06:59:36 GMT

X to block Grok AI's undressing feature | Digital Dilemma The mother of one of Elon Musk's children is suing his artificial intelligence company, saying its Grok chatbot allowed users to generate sexually-exploitative deepfake images of her that have caused her humiliation and emotional distress. The lawsuit was filed just before California Attorney General Rob Bonta sent a cease-and-desist letter to Musk's xAI company demanding that it stop the creation and distribution of Grok-generated nonconsensual sexualised imagery . Ashley St Clair, a writer and political commentator, alleges in a lawsuit filed on Thursday in New York City against xAI that she was the victim of sexualised deepfake images generated by Grok. St Clair, who is the mother of Musk's 16-month-old son, Romulus, said she reported the images to Musk's X social media platform, which hosts Grok, after they began appearing last year and asked that they be removed. The platform replied that the images did not violate its policies, she said.

deepfake image, musk, st clair, (16 more...)

Al Jazeera

Country:

North America > United States > New York (0.26)
North America > United States > California (0.25)
Asia > Japan (0.16)
(15 more...)

Industry:

Information Technology > Security & Privacy (0.96)
Law > Litigation (0.79)
Government > Regional Government > North America Government > United States Government (0.36)

Technology:

Information Technology > Artificial Intelligence > Vision (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.86)

Add feedback

Dyslexia and the Reading Wars

The New YorkerDec-22-2025, 11:00:00 GMT

Proven methods for teaching the readers who struggle most have been known for decades. Why do we often fail to use them? "There's a window of opportunity to intervene," Mark Seidenberg, a cognitive neuroscientist, said. "You don't want to let that go." In 2024, my niece Caroline received a Ph.D. in gravitational-wave physics. Her research interests include "the impact of model inaccuracies on biases in parameters recovered from gravitational wave data" and "Petrov type, principal null directions, and Killing tensors of slowly rotating black holes in quadratic gravity." I watched a little of her dissertation defense, on Zoom, and was lost as soon as she'd finished introducing herself. She and her husband now live in Italy, where she has a postdoctoral appointment. Caroline's academic achievements seem especially impressive if you know that until third grade she could barely read: to her, words on a page looked like a pulsing mass. She attended a private school in Connecticut, and there was a set time every day when students selected books to read on their own. "I can't remember how long that lasted, but it felt endless," she told me. She hid her disability by turning pages when her classmates did, and by volunteering to draw illustrations during group story-writing projects. One day, she told her grandmother that she could sound out individual letters but when she got to "the end of a row" she couldn't remember what had come before. A psychologist eventually identified her condition as dyslexia. Fluent readers sometimes think of dyslexia as a tendency to put letters in the wrong order or facing the wrong direction, but it's more complicated than that.

dyslexia, student, windward, (17 more...)

The New Yorker

Country:

North America > United States > Connecticut (0.24)
Europe > Italy (0.24)
North America > United States > New York > Bronx County > New York City (0.05)
(9 more...)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Education > Educational Setting (1.00)

Technology: Information Technology > Artificial Intelligence (0.68)

Add feedback

Philly's 'transit vigilante' created a real-time bus tracker for his neighbors

Philly's'transit vigilante' created a real-time bus tracker for his neighbors With a sports timer and some clever coding, Max Goldberg built a DIY display that tells South Philly commuters exactly when their next bus will arrive. Breakthroughs, discoveries, and DIY tips sent every weekday. Philadelphia's mass transit system has had a rough go of it lately. The Pennsylvania city's main public transit provider, SEPTA, has been dealing with massive service cuts, including the elimination of entire bus routes. But South Philly resident Max Goldberg is undeterred.

artificial intelligence, goldberg, real time system, (16 more...)

Popular Science

Country:

North America > United States > Pennsylvania (0.25)
North America > United States > California > San Francisco County > San Francisco (0.15)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Road (0.35)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence (1.00)
Information Technology > Architecture > Real Time Systems (1.00)

Add feedback

Machine learning methods fail to provide cohesive atheoretical construction of personality traits from semantic embeddings

Bouguettaya, Ayoub, Stuart, Elizabeth M.

arXiv.org Artificial IntelligenceOct-14-2025

Here, we test this hypothesis using novel machine learning methods to create a bottom-up, atheoretical model of personality from the same trait-descriptive adjective list that led to the dominant, contemporary model of personality (the Big Five). We then compare the descriptive utility of this machine learning method (resulting in lexical clusters) by comparing it to the established Big Five personality model in how well these describe conversations online (on Reddit forums). Our analysis of 1 million online comments shows that the Big Five model provides a much more powerful and interpretable description of these communities and the differences between them. Specifically, the dimensions of Agreeableness, Conscientiousness, and Neuroticism effectively distinguish Reddit communities. In contrast, our lexical clusters do not provide meaningful distinctions and fail to describe the spread. Validation against the International Personality Item Pool confirmed the Big Five model's superior psychometric coherence, and our machine learning methods notably failed to recover the trait of Extraversion. These results affirm the robustness of the Big Five, while also showing that the semantic structure of personality is likely depending on social context. Our findings suggest that while machine learning can help with understanding and explaining human behavior, especially by checking ecological validity of existing theories, machine learning methods may not be able to replace established psychological theories.

artificial intelligence, machine learning, personality, (16 more...)

arXiv.org Artificial Intelligence

2510.09739

Country: North America > United States (0.68)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Add feedback

Appendix: A Dual-Stream Neural Network Explains the Functional Segregation of Dorsal and Ventral Visual Pathways in Human Brains Minkyu Choi

Neural Information Processing SystemsOct-9-2025, 02:46:29 GMT

Fig. S1 displays the full set of region labels, corresponding to Regions including significant voxels from Fig.3(a) in the main text are As detailed in Section 3.1 of the main text, our model underwent a three-stage training process. After this stage, we conducted a fine-tuning process using the learned fixations from the WhereCNN. In this stage, the WhereCNN, after the pre-training in Stage 1, was incorporated to guide the WhatCNN's fixations. The model samples fixations from the predicted saliency maps from WhereCNN. As indicated in Section 3.1 of the main text, we utilized For All Stages All training stages were conducted using four NVIDIA A40 GPUs. Figure S2: Process of determining the next fixation point given the current fixation.

artificial intelligence, fixation, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.29)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.15)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Evaluating CxG Generalisation in LLMs via Construction-Based NLI Fine Tuning

Mackintosh, Tom, Madabushi, Harish Tayyar, Bonial, Claire

arXiv.org Artificial IntelligenceSep-23-2025

We probe large language models' ability to learn deep form-meaning mappings as defined by construction grammars. We introduce the ConTest-NLI benchmark of 80k sentences covering eight English constructions from highly lexicalized to highly schematic. Our pipeline generates diverse synthetic NLI triples via templating and the application of a model-in-the-loop filter. This provides aspects of human validation to ensure challenge and label reliability. Zero-shot tests on leading LLMs reveal a 24% drop in accuracy between naturalistic (88%) and adversarial data (64%), with schematic patterns proving hardest. Fine-tuning on a subset of ConTest-NLI yields up to 9% improvement, yet our results highlight persistent abstraction gaps in current LLMs and offer a scalable framework for evaluating construction-informed learning.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2509.16422

Country: North America > United States (0.46)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

BEFT: Bias-Efficient Fine-Tuning of Language Models

Huang, Baichuan, Balashankar, Ananth, Aminifar, Amir

arXiv.org Artificial IntelligenceSep-22-2025

Bias-only fine-tuning has the potential for unprecedented parameter efficiency. However, the link between fine-tuning different bias terms (i.e., bias terms in the query, key, or value projections) and downstream performance remains unclear. The existing approaches, e.g., based on the magnitude of bias change or empirical Fisher information, provide limited guidance for selecting the particular bias term for effective fine-tuning. In this paper, we propose an approach for selecting the bias term to be fine-tuned, forming the foundation of our bias-efficient fine-tuning (BEFT). We extensively evaluate our bias-efficient approach against other bias-selection approaches, across a wide range of large language models (LLMs) spanning encoder-only and decoder-only architectures from 110M to 6.7B parameters. Our results demonstrate the effectiveness and superiority of our bias-efficient approach on diverse downstream tasks, including classification, multiple-choice, and generation tasks.

bias term, large language model, machine learning, (22 more...)

arXiv.org Artificial Intelligence

2509.15974

Genre: Research Report > New Finding (1.00)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Meaning-infused grammar: Gradient Acceptability Shapes the Geometric Representations of Constructions in LLMs

Rakshit, Supantho, Goldberg, Adele

arXiv.org Artificial IntelligenceSep-10-2025

The usage-based constructionist (UCx) approach to language posits that language comprises a network of learned form-meaning pairings (constructions) whose use is largely determined by their meanings or functions, requiring them to be graded and probabilistic. This study investigates whether the internal representations in Large Language Models (LLMs) reflect the proposed function-infused gradience. We analyze representations of the English Double Object (DO) and Prepositional Object (PO) constructions in Pythia-$1.4$B, using a dataset of $5000$ sentence pairs systematically varied by human-rated preference strength for DO or PO. Geometric analyses show that the separability between the two constructions' representations, as measured by energy distance or Jensen-Shannon divergence, is systematically modulated by gradient preference strength, which depends on lexical and functional properties of sentences. That is, more prototypical exemplars of each construction occupy more distinct regions in activation space, compared to sentences that could have equally well have occured in either construction. These results provide evidence that LLMs learn rich, meaning-infused, graded representations of constructions and offer support for geometric measures for representations in LLMs.

artificial intelligence, large language model, natural language, (16 more...)

arXiv.org Artificial Intelligence

2507.22286

Country:

Europe (0.28)
North America > United States (0.14)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback