Goto

Collaborating Authors

 Instructional Material


The challenge of being neurodivergent in Japan's culture of conformity

The Japan Times

As awareness grows, more Japanese adults are receiving answers to struggles that went unrecognized for years. Social camouflaging can help neurodivergent people navigate social situations, but researchers say the effort often comes with significant emotional and mental strain. The first major crisis in Yosuke's life came when he stood in front of his students. Until then, the 24-year-old had navigated his life with few obstacles. He had done well in school, scored highly on IQ tests and graduated from university without any major issues. But after securing his dream job as a geography and history teacher at a girls' high school two years ago, cracks began to show. "I couldn't read the room," says Yosuke, who recalls struggling to organize course materials and wrap up classes on time.


Nested Learning: The Illusion of Deep Learning Architectures

Neural Information Processing Systems

Over the last decades, developing more powerful neural architectures and simultaneously designing optimization algorithms to effectively train them have been the core of research efforts to enhance the capability of machine learning models. Despite the recent progresses, particularly in developing Language Models (LMs), there are fundamental challenges and unanswered questions about how such models can continually learn/memorize, self-improved, and find ''effective solutions,''. In this paper, we present a new learning paradigm, called Nested Learning (NL), that coherently represents a model with a set of nested, multi-level, and/or parallel optimization problems, each of which with its own ''context flow''. NL reveals that existing deep learning methods learns from data through \emph{compressing} their own context flow, and explain how in-context learning emerges in large models. NL suggests a path (a new dimension to deep learning) to design more expressive learning algorithms with more ''levels'', resulting in higher-order in-context learning abilities. In addition to its neuroscientifically plausible and mathematically white-box nature, we advocate for its importance by presenting three core contributions: (1) Deep Optimizers: Based on NL, we show that well-known gradient-based optimizers (e.g., Adam, SGD with Momentum, etc.) are in fact associative memory modules that aim to compress the gradients with gradient descent. Building on this insight, we present a set of more expressive optimizers with deep memory and/or more powerful learning rules; (2) Self-Modifying Titans: Taking advantage of NL's insights on learning algorithms, we present a novel sequence model that learns how to modify itself by learning its own update algorithm; and (3) Continuum Memory System: We present a new formulation for memory system that generalizes the traditional viewpoint of ``long-term/short-term memory''. Combining our self-modifying sequence model with the continuum memory system, we present a learning module, called Hope, showing promising results in language modeling, continual learning, and long-context reasoning tasks.


Defining Autonomy for Wellness Robots in Senior Care

IEEE Spectrum Robotics

Download this complimentary White Paper today! This White Paper gives engineers, researchers, and care professionals an overview of how socially assistive wellness robots can support senior wellness, and how a framework can measure their autonomy. What you will learn about:ย  Why the senior care crisis exceeds incremental healthcare automation. Staffing shortages, rising dementia prevalence, and limited daily wellness programming all play a part. How the seven ICAA dimensions of wellness define a distinct category of socially assistive robot, separate from companion devices, medical devices, and general-purpose humanoids. How the Care Robot Autonomy Scale (CRAS), a six-level framework modeled on a driving-automation standard, measures autonomy across four wellness dimensions. What technical capabilities, clinical evidence, and a three-phase roadmap suggest about the path from current practice toward full wellness autonomy in the early 2030s. Click 'LOOK INSIDE' to Download Now.


Fire360: A Benchmark for Robust Perception and Episodic Memory in Degraded 360 Firefighting Video

Neural Information Processing Systems

Modern AI systems struggle most in environments where reliability is critical - scenes with smoke, poor visibility, and structural deformation. Each year, tens of thousands of firefighters are injured on duty, often due to breakdowns in situational perception. We introduce Fire360, a benchmark for evaluating perception and reasoning in safety-critical firefighting scenarios. The dataset includes 228 360 videos from professional training sessions under diverse conditions (e.g., low light, thermal distortion), annotated with action segments, object locations, and degradation metadata. Fire360 supports five tasks: Visual Question Answering, Temporal Action Captioning, Object Localization, Safety-Critical Reasoning, and Transformed Object Retrieval (TOR). TOR tests whether models can match pristine exemplars to fire-damaged counterparts in unpaired scenes, evaluating episodic memory under irreversible visual transformations. While human experts achieve 83.5% on TOR, models like GPT-4o lag significantly, exposing failures in reasoning under degradation. By releasing Fire360 and its evaluation suite, we aim to advance models that not only see, but also remember, reason, and act under uncertainty.


HARDMath2: A Benchmark for Applied Mathematics Built by Students as Part of a Graduate Class

Neural Information Processing Systems

Large language models (LLMs) have shown remarkable progress in mathematical problem-solving, but evaluation has largely focused on problems that have exact analytical solutions or involve formal proofs, often overlooking approximation-based problems ubiquitous in applied science and engineering. To fill this gap, we build on prior work and present $\textbf{HARDMath2}$, a dataset of 211 original problems covering the core topics in an introductory graduate applied math class, including boundary-layer analysis, WKB methods, asymptotic solutions of nonlinear partial differential equations, and the asymptotics of oscillatory integrals. This dataset was designed and verified by the students and instructors of a core graduate applied mathematics course at Harvard. We build the dataset through a novel collaborative environment that challenges students to write and refine difficult problems consistent with the class syllabus, peer-validate solutions, test different models, and automatically check LLM-generated solutions against their own answers and numerical ground truths. Evaluation results show that leading frontier models still struggle with many of the problems in the dataset, highlighting a gap in the mathematical reasoning skills of current LLMs. Importantly, students identified strategies to create increasingly difficult problems by interacting with the models and exploiting common failure modes. This back-and-forth with the models not only resulted in a richer and more challenging benchmark but also led to qualitative improvements in the students' understanding of the course material, which is increasingly important as we enter an age where state-of-the-art language models can solve many challenging problems across a wide domain of fields.


Get officially certified in Claude AI for just 19.99

PCWorld

When you purchase through links in our articles, we may earn a small commission. Get officially certified in Claude AI for just $19.99 A Claude AI Professional E-Degree is on sale for $19.99 (reg. AI skills are no longer a nice-to-have. A verifiable credential in one of the most popular AI models on the market is a real resume differentiator, and right now, you can get an e-degree in Claude for just $19.99 (reg. While plenty of people have dabbled with Claude, there's a big difference between "I've used it a few times" and actually knowing how to make it work for you.


Multicalibration Boosting: Theory, Convergence, and Transferability

arXiv.org Machine Learning

Multicalibration extends classical calibration by requiring predictions to be unbiased over a rich collection of functions, encompassing both prediction slices and subpopulations. It has emerged as a powerful framework for fairness, robustness, and reliable prediction, yet the theoretical understanding of multicalibration boosting (MCBoost) remains fragmented and often relies on restrictive assumptions. In this work, we develop a unified and refined perspective on MCBoost that subsumes existing variants, including multiaccuracy, BatchGCP, and BatchMVP. We uncover several phenomena that provide new insights into its practical behavior: even highly accurate and flexible predictors can remain substantially miscalibrated; enforcing multicalibration introduces a calibration-risk trade-off; and early stopping plays a central role in controlling this trade-off. On the theoretical side, we establish a general framework for MCBoost under weaker and more realistic conditions. We show that the boosting iterates converge to a Bregman projection of the population-optimal predictor onto the cumulative span generated by the audit class, thereby explicitly characterizing the function space on which multicalibration is achieved. We further derive convergence rates under different smoothness assumptions, finite-sample guarantees, and principled stopping rules that ensure multicalibration at termination. Finally, we extend the theory of universal adaptability under covariate shift, providing more general transfer guarantees and clarifying when multicalibrated predictors generalize across domains. These results provide a more complete theoretical foundation and practical guidance for multicalibration boosting, positioning it as both a unifying framework and a reliable post-processing approach for modern predictive models.


LaGuardia Airport AI hologram answers traveler questions

FOX News

LaGuardia Airport's Terminal B now features an AI hologram concierge that answers traveler questions and provides step-by-step directions using real-time maps.


Concomitant DAG Learning: On the Roles of Noise Adaptivity, Sparsity, and Non-negativity

arXiv.org Machine Learning

Directed acyclic graphs (DAGs) constitute a central modeling tool to enable principled reasoning about cause-effect interactions in complex systems. However, since the causal structure underlying a group of variables is often unknown and interventions may be infeasible or ethically challenging to implement, there is a need to address the task of inferring DAGs from observational data. However, most classical structure identification approaches face two key obstacles: the combinatorial challenge of enforcing acyclicity, which severely limits scalability, and identifiability challenges arising from latent confounding or heterogeneous noise. This tutorial offers an overview of recent signal processing and optimization advances that address these issues by recasting DAG structure learning as a continuous, score-based estimation problem over adjacency matrices. We begin with a didactic introduction to structural equation models and the formulation of causal graph recovery, followed by a historical survey of score-based methods ranging from early combinatorial search schemes and greedy heuristics to modern continuous frameworks that leverage smooth characterizations of acyclicity. Building on this foundation, we describe concomitant DAG estimation methods that jointly infer sparse causal structure and exogenous noise levels, improving robustness under heteroscedasticity and distribution shifts by rendering the estimator noise adaptive. All in all, the tutorial introduces readers to challenges and opportunities for signal processing research at the crossroads of causal inference, high-dimensional statistics, and scalable graph learning, while outlining emerging directions including online, nonlinear, and neural causal discovery.


Get the newest Microsoft dev tools plus 15 coding courses -- only 50

PCWorld

When you purchase through links in our articles, we may earn a small commission. TL;DR: The Microsoft Visual Studio Professional 2026 bundle includes 15 coding courses and is on sale for $49.97 (regularly $1,999.99) This is the kind of tech purchase that tends to pay for itself pretty quickly. The Microsoft Visual Studio Professional 2026 bundle pairs one of the most widely used development environments in the industry with a full library of coding courses -- all for a single one-time payment. Visual Studio Professional 2026 has been a staple for professional developers for years, and the 2026 version pushes productivity even further.