Goto

Collaborating Authors

 Education


Research Reveals the Optimal Way to Optimize

WIRED

The leading approach to the simplex method, a widely used technique for balancing complex logistical constraints, can't get any better. In 1939, upon arriving late to his statistics course at UC Berkeley, George Dantzig--a first-year graduate student--copied two problems off the blackboard, thinking they were a homework assignment. He found the homework "harder to do than usual," he would later recount, and apologized to the professor for taking some extra days to complete it. A few weeks later, his professor told him that he had solved two famous open problems in statistics. Dantzig's work would provide the basis for his doctoral dissertation and, decades later, inspiration for the film .


Would You Trust a 22-Year-Old AI Billionaire With the Global Economy?

The Atlantic - Technology

B rendan Foody is 22 years old and runs a company worth billions. This August, I met the young CEO in a glass conference room overlooking the San Francisco Bay. While his peers are searching for their first jobs, Foody is pursuing a " master plan," as he calls it, to upend the global labor market. His start-up, Mercor, offers an AI-powered hiring platform: Bots weed through rรฉsumรฉs, and even conduct interviews. In the next five years, Foody told me, AI could automate 50 percent of the tasks that people do today.


The 20 best video games of 2025

The Guardian

An arena warrior on a losing streak takes refuge in a vast forest where she discovers the joy of working in a cosy teashop. From this simple premise comes a joyful game of mindfulness and social interaction, as Alta learns how to serve up witty conversation and decent hot drinks. Colourful and highly stylised, it is a thoughtful study of burnout and recovery. An attempted-murder mystery set in an a 1920s all-girls private school reveals itself to also be an eviscerating takedown of British class politics. Witty and beautifully drawn, it is full of amusing boarding-school stereotypes, from self-interested prefects to a terrifying matron, whose motivations and personal grievances must be slowly unpicked.


Advantages and limitations in the use of transfer learning for individual treatment effects in causal machine learning

arXiv.org Machine Learning

Generalizing causal knowledge across diverse environments is challenging, especially when estimates from large-scale datasets must be applied to smaller or systematically different contexts, where external validity is critical. Model-based estimators of individual treatment effects (ITE) from machine learning require large sample sizes, limiting their applicability in domains such as behavioral sciences with smaller datasets. We demonstrate how estimation of ITEs with Treatment Agnostic Representation Networks (TARNet; Shalit et al., 2017) can be improved by leveraging knowledge from source datasets and adapting it to new settings via transfer learning (TL-TARNet; Aloui et al., 2023). In simulations that vary source and sample sizes and consider both randomized and non-randomized intervention target settings, the transfer-learning extension TL-TARNet improves upon standard TARNet, reducing ITE error and attenuating bias when a large unbiased source is available and target samples are small. In an empirical application using the India Human Development Survey (IHDS-II), we estimate the effect of mothers' firewood collection time on children's weekly study time; transfer learning pulls the target mean ITEs toward the source ITE estimate, reducing bias in the estimates obtained without transfer. These results suggest that transfer learning for causal models can improve the estimation of ITE in small samples.


2025 AAAI / ACM SIGAI Doctoral Consortium interviews compilation

AIHub

Authors pictured in order of their interview publication date (left to right, top to bottom). Each year, a small group of PhD students are chosen to participate in the AAAI/SIGAI Doctoral Consortium . This initiative provides an opportunity for the students to discuss and explore their research interests and career objectives in an interdisciplinary workshop together with a panel of established researchers. During 2025, we met with some of the students to find out more about their research and the doctoral consortium experience. Kunpeng Xu completed his PhD at the Universitรฉ de Sherbrooke and is now a postdoctoral fellow at McGill University.


Over half of deepfakes of underage victims made by classmates, Japanese police say

The Japan Times

The National Police Agency plans to warn against the obscene use of AI at delinquency-prevention lectures at schools and other events. More than half of cases reported to Japanese police of explicit deepfakes targeting those aged under 18 were created with the involvement of students from the same schools as the victims, National Police Agency data have shown. This is the first time that the NPA has released information on minors who became victims of obscene fake images created using generative artificial intelligence and other technologies. The agency plans to create flyers and warn against such use of AI at delinquency-prevention lectures at schools and other locations. According to the NPA, police were consulted over 79 cases of deepfakes targeting those up to the age of 17 from January to September this year.


A Teacher-Student Perspective on the Dynamics of Learning Near the Optimal Point

arXiv.org Machine Learning

Near an optimal learning point of a neural network, the learning performance of gradient descent dynamics is dictated by the Hessian matrix of the loss function with respect to the network parameters. We characterize the Hessian eigenspectrum for some classes of teacher-student problems, when the teacher and student networks have matching weights, showing that the smaller eigenvalues of the Hessian determine long-time learning performance. For linear networks, we analytically establish that for large networks the spectrum asymptotically follows a convolution of a scaled chi-square distribution with a scaled Marchenko-Pastur distribution. We numerically analyse the Hessian spectrum for polynomial and other non-linear networks. Furthermore, we show that the rank of the Hessian matrix can be seen as an effective number of parameters for networks using polynomial activation functions. For a generic non-linear activation function, such as the error function, we empirically observe that the Hessian matrix is always full rank.


A Bayesian latent class reinforcement learning framework to capture adaptive, feedback-driven travel behaviour

arXiv.org Machine Learning

Many travel decisions involve a degree of experience formation, where individuals learn their preferences over time. At the same time, there is extensive scope for heterogeneity across individual travellers, both in their underlying preferences and in how these evolve. The present paper puts forward a Latent Class Reinforcement Learning (LCRL) model that allows analysts to capture both of these phenomena. We apply the model to a driving simulator dataset and estimate the parameters through Variational Bayes. We identify three distinct classes of individuals that differ markedly in how they adapt their preferences: the first displays context-dependent preferences with context-specific exploitative tendencies; the second follows a persistent exploitative strategy regardless of context; and the third engages in an exploratory strategy combined with context-specific preferences.


Brie, cheddar, and other high-fat cheeses linked to lower dementia risk

Popular Science

Breakthroughs, discoveries, and DIY tips sent every weekday. It's been found in ancient human feces . The U.S. government stored 6.4 metric tons of it in mountains . And a big hunk of it played a major role in a presidential farewell party . While too much of the popular dairy product can spell tummy troubles and high cholesterol for some, new research suggests that eating more high-fat cheese and cream could be linked to a lower risk of developing dementia .