AITopics | bubeck

Collaborating Authors

bubeck

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Polynomial Cost of Adaptation for X-Armed Bandits

Hedi Hadiji

Neural Information Processing SystemsFeb-14-2026, 11:18:51 GMT

Then satisfies inequation 8 >0, 8 6 , ( )>1 ( ) +1 .

artificial intelligence, machine learning, section 3, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > France (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.47)

Add feedback

Equipping Experts/Bandits with Long-term Memory

Kai Zheng, Haipeng Luo, Ilias Diakonikolas, Liwei Wang

Neural Information Processing SystemsFeb-11-2026, 22:30:44 GMT

We propose the first reduction-based approach to obtaining long-term memory guarantees for online learning in the sense of Bousquet and Warmuth[8], by reducing the problem to achieving typical switching regret.

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (0.58)
Information Technology > Artificial Intelligence > Machine Learning (0.48)

Add feedback

OptimalBest-ArmIdentificationMethodsfor Tail-RiskMeasures

Neural Information Processing SystemsFeb-11-2026, 09:17:48 GMT

The algorithm requires solving a non-convex optimization problem inthespace ofprobability measures, thatrequires delicate analysis.

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Asia > India > Maharashtra > Mumbai (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.69)

Add feedback

e3251075554389fe91d17a794861d47b-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 20:18:28 GMT

This perspectiveparallels an earlier phenomenon inthe much better understood field of optimization where convexity has played a preponderant role for both theoretical and methodological advances [Nes04; Bub15].

artificial intelligence, inequality, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
Asia > Middle East > Jordan (0.05)
Europe > Netherlands > North Holland > Amsterdam (0.04)
(5 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

TightFirst-andSecond-OrderRegretBounds forAdversarialLinearBandits

Neural Information Processing SystemsFeb-7-2026, 14:23:42 GMT

In addition, we need only assumptions weaker than those of existing algorithms; our algorithms work on discrete action sets as well as continuous ones without apriori knowledge about losses, and theyrun efficiently ifalinear optimization oracle for the action set is available.

algorithm, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

3 Common Misunderstandings About AI in 2025

TIME - TechDec-29-2025, 16:00:00 GMT

Children and parked cars are color-coded on a monitor inside a Mercedes-Benz S-Class during an autonomous driving and AI demonstration in Immendingen, Germany on July 17, 2018. Children and parked cars are color-coded on a monitor inside a Mercedes-Benz S-Class during an autonomous driving and AI demonstration in Immendingen, Germany on July 17, 2018. In 2025, misconceptions about AI flourished as people struggled to make sense of the rapid development and adoption of the technology. Here are three popular ones to leave behind in the New Year. When GPT-5 was released in May, people wondered (not for the first time) if AI was hitting a wall.

eroding critical thinking skill, gpt-5, human driver, (12 more...)

TIME - Tech

Country:

Europe > Germany (0.46)
North America > United States > New York (0.05)
North America > United States > California > San Francisco County > San Francisco (0.05)
(3 more...)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Automobiles & Trucks > Manufacturer (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Universal Law of Robustness via Isoperimetry

Neural Information Processing SystemsDec-25-2025, 05:42:59 GMT

Classically, data interpolation with a parametrized model class is possible as long as the number of parameters is larger than the number of equations to be satisfied. A puzzling phenomenon in the current practice of deep learning is that models are trained with many more parameters than what this classical theory would suggest. We propose a theoretical explanation for this phenomenon. We prove that for a broad class of data distributions and model classes, overparametrization is {\em necessary} if one wants to interpolate the data {\em smoothly}. Namely we show that {\em smooth} interpolation requires $d$ times more parameters than mere interpolation, where $d$ is the ambient data dimension. We prove this universal law of robustness for any smoothly parametrized function class with polynomial size weights, and any covariate distribution verifying isoperimetry. In the case of two-layers neural networks and Gaussian covariates, this law was conjectured in prior work by Bubeck, Li and Nagaraj. We also give an interpretation of our result as an improved generalization bound for model classes consisting of smooth functions.

name change, robustness, universal law, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.60)

Add feedback

How social media encourages the worst of AI boosterism

MIT Technology ReviewDec-23-2025, 10:00:00 GMT

The era of hype first, think later. Demis Hassabis, CEO of Google DeepMind, summed it up in three words: "This is embarrassing." Hassabis was replying on X to an overexcited post by Sébastien Bubeck, a research scientist at the rival firm OpenAI, announcing that two mathematicians had used OpenAI's latest large language model, GPT-5, to find solutions to 10 unsolved problems in mathematics. "Science acceleration via AI has officially begun," Bubeck crowed. Put your math hats on for a minute, and let's take a look at what this beef from mid-October was about. Bubeck was excited that GPT-5 seemed to have somehow solved a number of puzzles known as Erdős problems.

ai boosterism, mathematician, social media, (13 more...)

MIT Technology Review

Country:

North America > United States > Massachusetts (0.05)
Europe > United Kingdom > England > Greater Manchester > Manchester (0.05)
Asia > China (0.05)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.47)

Add feedback

A Universal Law of Robustness via Isoperimetry

Neural Information Processing SystemsJan-19-2025, 13:09:11 GMT

isoperimetry, model class, universal law, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.44)

Add feedback

Small Language Models for Application Interactions: A Case Study

Li, Beibin, Zhang, Yi, Bubeck, Sébastien, Pathuri, Jeevan, Menache, Ishai

arXiv.org Artificial IntelligenceMay-23-2024

Large Language Models (LLMs) are becoming pervasive in assisting humans with a wide variety of tasks, such as writing documents, presenting work, coding and health assistant. Generative LLMs are being rapidly integrated in user-facing software, for answering questions and increasing productivity through simple, language based interactions with technology. One of the key operating principles behind LLMs is exploiting their ability to generalize to unseen tasks by providing examples through the prompt itself - an approach commonly known as in-context learning. While LLMs are being designed to support larger prompt sizes, processing very large prompts might be expensive and incur non-negligible latencies. In this paper, we consider the alternative of using Small Language Models (SLMs), which are being developed nowadays and open-sourced by several companies.

application, arxiv preprint arxiv, language model, (14 more...)

arXiv.org Artificial Intelligence

2405.20347

Country:

North America > United States > New York (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback