Educational Setting
Teenagers in Tokyo allegedly used ChatGPT to decide extortion amount in assault case
A group of high school students arrested over allegedly trying to extort money from a boy in western Tokyo may have used ChatGPT to decide how much to demand, police said. A group of high school students in Tokyo arrested over allegedly assaulting a boy and trying to extort money from him may have used ChatGPT to decide how much to demand, media reports have recently revealed. Five teenagers, including a 17-year-old girl and four boys ranging in age from 16 to 17, were arrested in January over the alleged assault and attempted extortion of a 17-year-old high school student in the city of Hachioji in western Tokyo, according to the Metropolitan Police Department. Police said the suspects assaulted the boy in a plaza in Hachioji's Shiroyamate district, breaking his nose and causing other injuries, before allegedly trying to extort ¥150,000 ($935) from him. The girl, who was the victim's ex-girlfriend, allegedly first confronted him, accusing him of touching her younger sister's leg. She then challenged him, saying, "Give me the money or fight me one-on-one," according to reports by Fuji TV.
Contribution of task-irrelevant stimuli to drift of neural representations
Biological and artificial learners are inherently exposed to a stream of data and experience throughout their lifetimes and must constantly adapt to, learn from, or selectively ignore the ongoing input. Recent findings reveal that, even when the performance remains stable, the underlying neural representations can change gradually over time, a phenomenon known as representational drift. Studying the different sources of data and noise that may contribute to drift is essential for understanding lifelong learning in neural systems. However, a systematic study of drift across architectures and learning rules, and the connection to task, are missing. Here, in an online learning setup, we characterize drift as a function of data distribution, and specifically show that the learning noise induced by task-irrelevant stimuli, which the agent learns to ignore in a given context, can create long-term drift in the representation of task-relevant stimuli. Using theory and simulations, we demonstrate this phenomenon both in Hebbian-based learning---Oja's rule and Similarity Matching---and in stochastic gradient descent applied to autoencoders and a supervised two-layer network. We consistently observe that the drift rate increases with the variance and the dimension of the data in the task-irrelevant subspace.
Eight Predictions for the Future of Higher Education
The next decade won't be Armageddon. But it will bring a lot of change. When I started this six-week-long series by asking whether my daughter would be attending college in 2035, the year she turns eighteen, I was pretty sure I already knew the answer. Nine years is a short amount of time, and something disastrous--or, I suppose, downright liberatory--would need to happen in the culture to make it unlikely for her to shuttle off to some campus after high school. Still, thinking about the future of higher education has convinced me that her path to a bachelor's degree will be very different from the one I began in 1998.
Japan to launch language support project for foreign children
The number of public school students requiring special Japanese-language instruction reached a record high of 84,759 in fiscal 2025. The education ministry plans to launch a model project in fiscal 2027 to provide basic Japanese-language instruction for school life and classes to children of foreign nationals living in Japan. In response to an increase in the number of such children, the ministry aims to establish guidelines for effective language lessons through the project. The number of public school students requiring special Japanese-language instruction, including those who are unable to communicate adequately in daily Japanese conversation, reached a record high of 84,759 in fiscal 2025, which ended in March this year. The number doubled over the past nine years, according to the ministry. Of those students, about 10% were not given sufficient instruction at their schools due to staff shortages and other reasons.
Instead of Taking Your Job, A.I. Might Transform It
Proponents and critics of artificial intelligence often compare the technology to industrial automation--really, it's more like an intern. One summer during high school, I took a temporary job writing computer programs for a consulting firm. Each morning, I drove through rush-hour traffic to an office park near Princeton, New Jersey, on the crowded Route 1 corridor. At a desk in some sort of equipment room, I coded quick-and-dirty database tools for internal use. One of my programs simplified the process of logging hours into timesheets.
Lopez: As Compton students ace tests, educators are baffled by Rep. Maxine Waters' snub of school bond
Things to Do in L.A. Tap to enable a layout that focuses on the article. As Compton students ace tests, educators are baffled by Rep. Maxine Waters' snub of school bond Students walk on campus at Dominguez High School in Compton. A bond measure would provide millions of dollars to rebuild the school. This is read by an automated voice. Please report any issues or inconsistencies here .
Optimal Gap-Dependent Regret for Private Stochastic Decision-Theoretic Online Learning
Cesari, Tommaso, Colomboni, Roberto
We study stochastic decision-theoretic online learning with full information and event-level pure differential privacy. A COLT open problem of Hu and Mehta asks to determine the optimal gap-dependent regret rate for stochastic decision-theoretic online learning under pure event-level differential privacy. For $K$ actions, losses in $[0,1]$, and a unique best action separated from the second-best action by gap $Δ_{\min}$, the known lower bound is of order $ \frac{\log K}{\min\{Δ_{\min},\varepsilon\}}, $ or equivalently, up to universal constants, of order \[ \frac{\log K}{Δ_{\min}}+\frac{\log K}{\varepsilon}. \] We give a horizon-free pure-DP algorithm and prove the explicit regret bound \[ \operatorname{Reg}_T \le 1000 \cdot \left(\frac{\log K}{Δ_{\min}}+\frac{\log K}{\varepsilon}\right) \] for every horizon $T$. The numerical constant is not optimized. The algorithm partitions time into blocks of exponentially increasing size, plays a single action throughout each block, and chooses the next action by an exponential mechanism applied to a data-independent random prefix of the previous block. The random prefix converts block regret into a sum, over all prefix lengths, of softmax selection errors. A single entropy-potential argument controls all privacy-dominated large-gap actions at cost $\log K/\varepsilon$.
The Sample Complexity of Multiclass and Sparse Contextual Bandits
Erez, Liad, Chen, Fan, Cohen, Alon, Koren, Tomer, Mansour, Yishay, Moran, Shay, Rakhlin, Alexander
We study contextual bandits in the stochastic i.i.d.\ setting, where a learner observes contexts drawn from an unknown distribution, selects actions from a finite set $A$, and aims to identify an approximately optimal policy from a given class based on bandit feedback. Motivated by bandit multiclass classification with zero-one rewards, we focus on the \emph{$s$-sparse} setting in which, for every context, the reward vector has $L_1$-norm at most $s \ll |A|$. Our main result is the design of algorithms that, with high probability, output an $ε$-optimal policy compared to policy class $Π$ using $\tilde{O} ((s/ε^2 + |A|/ε)\log |Π|/δ)$ samples. We extend this bound to general Natarajan classes and complement it with a matching lower bound (up to logarithmic factors), thereby closing a substantial gap left by prior work (Erez et al., 2024, 2025), which incurred an additional $Θ(|A|^9)$ dependence. We obtain these results via two complementary approaches. First, we analyze contextual bandits through the lens of contextual decision making with structured observations, designing an exploration-by-optimization algorithm whose sample complexity is governed by the \emph{decision-estimation coefficient} (DEC; Foster et al., 2021, 2022). We show that, with $s$-sparse rewards, the induced model class admits a sharp DEC bound that scales with $s$ and directly yields the optimal rate. Since this approach is largely information-theoretic and involves solving complex min-max optimization problems, we also develop a second, more specialized algorithmic method based on a low-variance exploration technique. This approach leads to concrete, tractable algorithms and naturally extends to contextual combinatorial semi-bandits, leading to improved sample complexity guarantees for bandit multiclass list classification.