AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

The Role of Doctors Is Changing Forever

The New YorkerDec-19-2025, 11:00:00 GMT

Others say they don't need us. It's time for us to think of ourselves not as the high priests of health care but as what we have always been: healers. Not long ago, I cared for a middle-aged man I'll call Jim, who was generally healthy but had recently started to feel sluggish. One of his friends told him to try a hormone supplement. After Jim saw on social media that Robert F. Kennedy, Jr., the Trump Administration's Secretary of Health and Human Services, had endorsed supplements as a part of an "anti-aging" regimen, he ordered one from a telehealth company. A few months later, he noticed swelling and pain in his calf. ChatGPT warned him that he might have a blood clot.

health care, medicine, physician, (13 more...)

The New Yorker

Country:

Europe > United Kingdom (0.14)
North America > United States > Pennsylvania (0.05)
North America > United States > New York (0.05)
North America > United States > California (0.04)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Health Care Technology > Telehealth (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Communications > Social Media (0.50)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.50)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.35)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

Stackelberg Learning from Human Feedback: Preference Optimization as a Sequential Game

Pásztor, Barna, Buening, Thomas Kleine, Krause, Andreas

arXiv.org Machine LearningDec-19-2025

We introduce Stackelberg Learning from Human Feedback (SLHF), a new framework for preference optimization. SLHF frames the alignment problem as a sequential-move game between two policies: a Leader, which commits to an action, and a Follower, which responds conditionally on the Leader's action. This approach decomposes preference optimization into a refinement problem for the Follower and an optimization problem against an adversary for the Leader. Unlike Reinforcement Learning from Human Feedback (RLHF), which assigns scalar rewards to actions, or Nash Learning from Human Feedback (NLHF), which seeks a simultaneous-move equilibrium, SLHF leverages the asymmetry of sequential play to capture richer preference structures. The sequential design of SLHF naturally enables inference-time refinement, as the Follower learns to improve the Leader's actions, and these refinements can be leveraged through iterative sampling. We compare the solution concepts of SLHF, RLHF, and NLHF, and lay out key advantages in consistency, data sensitivity, and robustness to intransitive preferences. Experiments on large language models demonstrate that SLHF achieves strong alignment across diverse preference datasets, scales from 0.5B to 8B parameters, and yields inference-time refinements that transfer across model families without further fine-tuning.

learning, tackelberg gda, wen 2, (13 more...)

arXiv.org Machine Learning

2512.16626

Country:

Europe > Switzerland > Zürich > Zürich (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Data Valuation for LLM Fine-Tuning: Efficient Shapley Value Approximation via Language Model Arithmetic

Tamine, Mélissa, Sakhi, Otmane, Heymann, Benjamin

arXiv.org Machine LearningDec-19-2025

Data is a critical asset for training large language models (LLMs), alongside compute resources and skilled workers. While some training data is publicly available, substantial investment is required to generate proprietary datasets, such as human preference annotations or to curate new ones from existing sources. As larger datasets generally yield better model performance, two natural questions arise. First, how can data owners make informed decisions about curation strategies and data sources investment? Second, how can multiple data owners collaboratively pool their resources to train superior models while fairly distributing the benefits? This problem, data valuation, which is not specific to large language models, has been addressed by the machine learning community through the lens of cooperative game theory, with the Shapley value being the prevalent solution concept. However, computing Shapley values is notoriously expensive for data valuation, typically requiring numerous model retrainings, which can become prohibitive for large machine learning models. In this work, we demonstrate that this computational challenge is dramatically simplified for LLMs trained with Direct Preference Optimization (DPO). We show how the specific mathematical structure of DPO enables scalable Shapley value computation. We believe this observation unlocks many applications at the intersection of data valuation and large language models.

data source, international conference, shapley value, (12 more...)

arXiv.org Machine Learning

2512.15765

Country:

Europe > Austria > Vienna (0.15)
Europe > France (0.05)
Oceania > Australia > New South Wales > Sydney (0.04)
(9 more...)

Genre: Research Report (0.83)

Industry: Leisure & Entertainment > Games (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

China figured out how to sell EVs. Now it has to bury their batteries.

MIT Technology ReviewDec-18-2025, 21:24:58 GMT

China figured out how to sell EVs. Now it has to bury their batteries. As early electric cars age out, hundreds of thousands of used batteries are flooding the market, fueling a gray recycling economy even as Beijing and big manufacturers scramble to build a more orderly system. In August 2025, Wang Lei decided it was finally time to say goodbye to his electric vehicle. Wang, who is 39, had bought the car in 2016, when EVs still felt experimental in Beijing. It was a compact Chinese brand.

battery, china, sell ev, (14 more...)

MIT Technology Review

Country:

Asia > China > Beijing > Beijing (0.45)
North America > United States > Massachusetts (0.05)
Asia > China > Shanghai > Shanghai (0.05)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Transportation > Electric Vehicle (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.30)

Add feedback

How America Gave China an Edge in Nuclear Power

The New YorkerDec-18-2025, 20:26:06 GMT

Though the two countries are now in a race to develop atomic technology, China's most advanced reactor was the result of collaboration with American scientists. This April, in a speech given at the Shanghai branch of the Chinese Academy of Sciences, the physicist Xu Hongjie announced a breakthrough. For over a decade, his team had been working on an experimental nuclear reactor that runs on a lava-hot solution of fissile material and molten salt, rather than on solid fuel. The reactor, which went online two years ago, was a feat in itself. It is still the only one of its kind in operation in the world, and has the potential to be both safer and more efficient than the water-cooled nuclear plants that dominate the industry. Now, Xu explained, his team had been able to refuel the reactor without shutting it down, demonstrating a level of mastery over their new system. As dazzling as that was, the timing of Xu's speech also freighted the topic with geopolitical import. Only a few months earlier, DeepSeek, the Chinese artificial-intelligence company, had set alarms ringing through the U.S. tech world when it became clear that the relatively small Chinese startup, operating under U.S. export controls, had created a large language model that rivalled anything devised by the behemoths of Silicon Valley.

china, molten-salt reactor, reactor, (12 more...)

The New Yorker

Country:

North America > United States > California (0.34)
Asia > China > Shanghai > Shanghai (0.25)
North America > United States > Tennessee (0.05)
(7 more...)

Genre:

Personal (0.48)
Research Report (0.48)

Industry:

Government (1.00)
Energy > Power Industry > Utilities > Nuclear (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.54)

Add feedback

George Osborne has a new job in tech, and it doesn't bode well for Britain Chris Stokel-Walker

The GuardianDec-18-2025, 13:35:56 GMT

George Osborne has a new job in tech, and it doesn't bode well for Britain OpenAI is the latest to make a political hire as big tech spreads its tentacles around the world. Since leaving frontline politics, the former chancellor has served as the chair of the Northern Powerhouse Partnership, edited (not entirely successfully) the Evening Standard, advised asset manager BlackRock, joined boutique advisory firm Robey Warshaw, been appointed as the chair of the British Museum and taken on roles including advising crypto firm Coinbase . But Osborne's latest job is the most eye-opening - and is an alarming augur of what is to come. OpenAI, the maker of ChatGPT, has become the latest organisation to employ Osborne . He will run OpenAI for Countries, a unit tasked with working directly with governments while expanding the company's Stargate datacentre programme beyond the US.

george osborne, openai, osborne, (12 more...)

The Guardian

Country:

Europe > United Kingdom > England > Buckinghamshire > Milton Keynes (0.06)
Oceania > Australia (0.05)
North America > United States > California (0.05)
Europe > Ukraine (0.05)

Industry:

Government (1.00)
Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

OpenAI just launched an app store inside ChatGPT

EngadgetDec-18-2025, 13:30:58 GMT

Warner Bros. rejects Paramount's hostile bid Some help you pull-in locally-stored data and another lets you organize Apple Music songs. OpenAI has introduced an app directory that's now available right inside ChatGPT, the company announced. Apps extend ChatGPT conversations by bringing in new context and letting users take action like order groceries, turn an outline into a slide deck or search for an apartment, the company wrote in a blog post . OpenAI also noted in a help document that connector apps like Google Drive are now simply called apps. The new apps section (on iOS, Android and web) is divided into Feature, Lifestyle and Productivity categories, letting you connect to commonly used apps and sites like Booking.com,

app, chatgpt, openai just, (8 more...)

Engadget

Industry:

Leisure & Entertainment (0.75)
Media > Music (0.40)
Information Technology > Services (0.37)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.90)

Add feedback

The Download: the worst technology of 2025, and Sam Altman's AI hype

MIT Technology ReviewDec-18-2025, 13:10:00 GMT

Welcome to our annual list of the worst, least successful, and simply dumbest technologies of the year. We like to think there's a lesson in every technological misadventure. But when technology becomes dependent on power, sometimes the takeaway is simpler: it would have been better to stay away. Here are some of the more notable ones . Each time you've heard a borderline outlandish idea of what AI will be capable of, it often turns out that Sam Altman was, if not the first to articulate it, at least the most persuasive and influential voice behind it. For more than a decade he has been known in Silicon Valley as a world-class fundraiser and persuader.

download, mit technology review, sam altman, (14 more...)

MIT Technology Review

Country:

North America > United States > California (0.25)
Asia > China (0.08)
North America > United States > Nebraska (0.05)
(2 more...)

Industry: Energy > Power Industry (0.49)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.32)

Add feedback

The 8 worst technology flops of 2025

MIT Technology ReviewDec-18-2025, 12:00:00 GMT

The Cybertruck, sycophantic AI, and humanoid robots all made this year's list of the biggest technology failures. Welcome to our annual list of the worst, least successful, and simply dumbest technologies of the year. This year, politics was a recurring theme. Donald Trump swept back into office and used his executive pen to reshape the fortunes of entire sectors, from renewables to cryptocurrency. The wrecking-ball act began even before his inauguration, when the president-elect marketed his own memecoin, $TRUMP, in a shameless act of merchandising that, of course, we honor on this year's worst tech list. We like to think there's a lesson in every technological misadventure.

cybertruck, mit technology review, worst technology flop, (13 more...)

MIT Technology Review

Country:

North America > United States > Texas (0.05)
North America > United States > Massachusetts (0.05)
North America > United States > California > San Francisco County > San Francisco (0.05)
Europe > Germany (0.05)

Industry:

Health & Medicine > Therapeutic Area (0.98)
Government > Regional Government > North America Government > United States Government (0.49)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Robots (0.71)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.31)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.31)

Add feedback

Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Capabilities of Next-Token Prediction

Blondel, Mathieu, Sander, Michael E., Vivier-Ardisson, Germain, Liu, Tianlin, Roulet, Vincent

arXiv.org Machine LearningDec-18-2025

Autoregressive models (ARMs) currently constitute the dominant paradigm for large language models (LLMs). Energy-based models (EBMs) represent another class of models, which have historically been less prevalent in LLM development, yet naturally characterize the optimal policy in post-training alignment. In this paper, we provide a unified view of these two model classes. Taking the chain rule of probability as a starting point, we establish an explicit bijection between ARMs and EBMs in function space, which we show to correspond to a special case of the soft Bellman equation in maximum entropy reinforcement learning. Building upon this bijection, we derive the equivalence between supervised learning of ARMs and EBMs. Furthermore, we analyze the distillation of EBMs into ARMs by providing theoretical error bounds. Our results provide insights into the ability of ARMs to plan ahead, despite being based on the next-token prediction paradigm.

autoregressive language model, ebm, probability, (14 more...)

arXiv.org Machine Learning

2512.15605

Country: Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.88)

Add feedback