AITopics | moritz hardt

Collaborating Authors

moritz hardt

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Characterization of Overfitting in Robust Multiclass Classification

Neural Information Processing SystemsFeb-17-2026, 22:01:59 GMT

Nonetheless, modern machine learning is adaptive in its nature. Prior information about a model's performance on the test set inevitably influences

algorithm, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country: Asia > China > Hubei Province > Wuhan (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

SupplementaryMaterialfor"DECAF: Generating FairSyntheticDataUsingCausally-AwareGenerative Networks "

Neural Information Processing SystemsFeb-10-2026, 22:05:52 GMT

The bottom graph is a historical example ofunfairness: evenifthere would benobias betweenLoanand Race,redlining(i.e. the practice of refusing aloan topeople living in certain areas) would discriminate indirectly based on race [1,2,3,4]. This example also showswhysimply removing or not measuring a sensitive attribute does not suffice: not only does this ignore indirect bias, but hiding the protected attribute leads to an (additional) correlation betweenPostcodeandLoandue to confounding. InTable 1, we observethat naively removing the protected attribute only ensures FTU fairness, asshown by: GAN-PR, WGAN-GP-PR, and DECAF-PR. This is the direct result of the construction of generatorG and follows a similar argument asProposition 2of[6]. P(Xi|{Xj:(Xj Xi) E}) Given each Gi (see Eq. 2 paper) has enough capacity,G can thus express the full distribution PX(X).

artificial intelligence, fairness, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.47)

Add feedback

Adaptive Distance Estimation

Neural Information Processing SystemsFeb-9-2026, 03:33:36 GMT

O hides log (nd /") factors.

algorithm 1, artificial intelligence, machine learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Rhode Island > Providence County > Providence (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(17 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.69)

Add feedback

Look-Ahead Reasoning on Learning Platforms

Zhu, Haiqing, Zrnic, Tijana, Mendler-Dünner, Celestine

arXiv.org Machine LearningNov-19-2025

On many learning platforms, the optimization criteria guiding model training reflect the priorities of the designer rather than those of the individuals they affect. Consequently, users may act strategically to obtain more favorable outcomes, effectively contesting the platform's predictions. While past work has studied strategic user behavior on learning platforms, the focus has largely been on strategic responses to a deployed model, without considering the behavior of other users. In contrast, look-ahead reasoning takes into account that user actions are coupled, and -- at scale -- impact future predictions. Within this framework, we first formalize level-$k$ thinking, a concept from behavioral economics, where users aim to outsmart their peers by looking one step ahead. We show that, while convergence to an equilibrium is accelerated, the equilibrium remains the same, providing no benefit of higher-level reasoning for individuals in the long run. Then, we focus on collective reasoning, where users take coordinated actions by optimizing through their joint impact on the model. By contrasting collective with selfish behavior, we characterize the benefits and limits of coordination; a new notion of alignment between the learner's and the users' utilities emerges as a key concept. We discuss connections to several related mathematical frameworks, including strategic classification, performative prediction, and algorithmic collective action.

artificial intelligence, machine learning, reasoning, (19 more...)

arXiv.org Machine Learning

2511.14745

Genre: Research Report > New Finding (0.46)

Industry: Banking & Finance (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Game Theory (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

Rip van Winkle's Razor: A Simple Estimate of Overfit to Test Data

Arora, Sanjeev, Zhang, Yi

arXiv.org Machine LearningFeb-25-2021

Traditional statistics forbids use of test data (a.k.a. holdout data) during training. Dwork et al. 2015 pointed out that current practices in machine learning, whereby researchers build upon each other's models, copying hyperparameters and even computer code -- amounts to implicitly training on the test set. Thus error rate on test data may not reflect the true population error. This observation initiated {\em adaptive data analysis}, which provides evaluation mechanisms with guaranteed upper bounds on this difference. With statistical query (i.e. test accuracy) feedbacks, the best upper bound is fairly pessimistic: the deviation can hit a practically vacuous value if the number of models tested is quadratic in the size of the test set. In this work, we present a simple new estimate, {\em Rip van Winkle's Razor}. It relies upon a new notion of \textquotedblleft information content\textquotedblright\ of a model: the amount of information that would have to be provided to an expert referee who is intimately familiar with the field and relevant science/math, and who has been just been woken up after falling asleep at the moment of the creation of the test data (like \textquotedblleft Rip van Winkle\textquotedblright\ of the famous fairy tale). This notion of information content is used to provide an estimate of the above deviation which is shown to be non-vacuous in many modern settings.

description length, referee, rip van winkle, (12 more...)

arXiv.org Machine Learning

2102.13189

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (1.00)

Industry: Transportation > Air (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Programming Fairness in Algorithms

#artificialintelligenceSep-26-2020, 20:41:06 GMT

"Being good is easy, what is difficult is being just." "We need to defend the interests of those whom we've never met and never will." Note: This article is intended for a general audience to try and elucidate the complicated nature of unfairness in machine learning algorithms. As such, I have tried to explain concepts in an accessible way with minimal use of mathematics, in the hope that everyone can get something out of reading this. Supervised machine learning algorithms are inherently discriminatory. They are discriminatory in the sense that they use information embedded in the features of data to separate instances into distinct categories -- indeed, this is their designated purpose in life. This is reflected in the name for these algorithms which are often referred to as discriminative algorithms (splitting data into categories), in contrast to generative algorithms (generating data from a given category). When we use supervised machine learning, this "discrimination" is used as an aid to help us categorize our data into distinct categories within the data distribution, as illustrated below. Whilst this occurs when we apply discriminative algorithms -- such as support vector machines, forms of parametric regression (e.g.

algorithm, artificial intelligence, machine learning, (13 more...)

#artificialintelligence

Country: North America > United States (0.47)

Industry:

Law (1.00)
Government (0.94)
Information Technology (0.93)
Health & Medicine > Public Health (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.49)

Add feedback

Programming Fairness in Algorithms

#artificialintelligenceSep-1-2020, 01:21:06 GMT

Being good is easy, what is difficult is being just. We need to defend the interests of those whom we've never met and never will. Note: This article is intended for a general audience to try and elucidate the complicated nature of unfairness in machine learning algorithms. As such, I have tried to explain concepts in an accessible way with minimal use of mathematics, in the hope that everyone can get something out of reading this. Supervised machine learning algorithms are inherently discriminatory. They are discriminatory in the sense that they use information embedded in the features of data to separate instances into distinct categories -- indeed, this is their designated purpose in life. This is reflected in the name for these algorithms which are often referred to as discriminative algorithms (splitting data into categories), in contrast to generative algorithms (generating data from a given category). When we use supervised machine learning, this "discrimination" is used as an aid to help us categorize our data into distinct categories within the data distribution, as illustrated below. Whilst this occurs when we apply discriminative algorithms -- such as support vector machines, forms of parametric regression (e.g. For example, using last week's weather data to try and predict the weather tomorrow has no moral valence attached to it.

algorithm, artificial intelligence, machine learning, (13 more...)

#artificialintelligence

Country: North America > United States (0.47)

Industry:

Law (1.00)
Government (0.94)
Information Technology (0.93)
Health & Medicine > Public Health (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.49)

Add feedback

Optimal multiclass overfitting by sequence reconstruction from Hamming queries

Acharya, Jayadev, Suresh, Ananda Theertha

arXiv.org Machine LearningAug-8-2019

A primary concern of excessive reuse of test datasets in machine learning is that it can lead to overfitting. Multiclass classification was recently shown to be more resistant to overfitting than binary classification. In an open problem of COLT 2019, Feldman, Frostig, and Hardt ask to characterize the dependence of the amount of overfitting bias with the number of classes $m$, the number of accuracy queries $k$, and the number of examples in the dataset $n$. We resolve this problem and determine the amount of overfitting possible in multi-class classification. We provide computationally efficient algorithms that achieve overfitting bias of $\tilde{\Theta}(\max\{\sqrt{{k}/{(mn)}}, k/n\})$, matching the known upper bounds.

algorithm, artificial intelligence, machine learning, (15 more...)

arXiv.org Machine Learning

1908.03156

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback