AITopics | Energy

Catastrophic Goodhart: regularizing RLHF with KL divergence does not mitigate heavy-tailed reward misspecification

Neural Information Processing SystemsFeb-8-2026, 18:43:00 GMT

However, if error is heavy-tailed, some policies obtain arbitrarily high reward despite achieving no more utility than the base model-a phenomenon we call catastrophic Goodhart. We adapt a discrete optimization method to measure the tails of reward models, finding that they are consistent with light-tailed error.

kl divergence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry:

Information Technology (0.46)
Energy (0.46)
Banking & Finance (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.84)

Add feedback

47561f5e1dc53c7d119185e217b523d0-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 16:57:07 GMT

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Industry:

Information Technology (0.46)
Energy (0.46)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)
Information Technology > Data Science (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.47)

Add feedback

The world's smallest sea turtle lives in a noisy ocean

Popular ScienceFeb-8-2026, 15:19:00 GMT

Noisy ships and industry are impacting critically endangered Kemp's ridley sea turtles. Breakthroughs, discoveries, and DIY tips sent six days a week. For the world's smallest sea turtles, life in the ocean is getting pretty noisy. These relatively little turtles (on average they're still 75 to 100 pounds) mostly found in the Gulf of Mexico already face fishing gear accidents, seacraft collisions, plastic pollution, and habitat deterioration, and now excess noise may be harming the critically endangered and rare Kemp's ridley sea turtles (). We say because even though these sea turtles share waters with extremely busy shipping lanes, scientists know very little about their underwater hearing.

artificial intelligence, sea turtle, turtle, (11 more...)

Popular Science

Country:

North America > United States (0.35)
North America > Mexico (0.25)
Atlantic Ocean > Gulf of Mexico (0.25)

Genre: Research Report > New Finding (0.51)

Industry:

Energy (0.71)
Transportation > Marine (0.35)

Technology: Information Technology > Artificial Intelligence (0.51)

Add feedback

SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model: Supplementary Material Di Wang

Neural Information Processing SystemsFeb-8-2026, 13:44:08 GMT

In addition, only the foreground categories are considered.

artificial intelligence, dataset, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Brandenburg > Potsdam (0.05)
Asia > China > Hubei Province > Wuhan (0.05)
Oceania > Australia > New South Wales > Sydney (0.04)

Industry:

Transportation (1.00)
Leisure & Entertainment > Sports (1.00)
Law (0.68)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.44)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.69)

Add feedback

4054556fcaa934b0bf76da52cf4f92cb-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 13:14:39 GMT

de-stationary attention, forecasting, transformer, (13 more...)

Neural Information Processing Systems

Country:

North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
Asia > China > Beijing > Beijing (0.04)

Industry:

Health & Medicine (1.00)
Energy (0.67)

Technology:

Information Technology > Data Science (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

57e5cb96e22546001f1d6520ff11d9ba-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 12:25:03 GMT

arxiv preprint arxiv, learning, trajectory, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
North America > Canada (0.04)

Industry:

Energy (0.93)
Transportation (0.68)
Leisure & Entertainment (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

51311013e51adebc3c34d2cc591fefee-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 10:27:52 GMT

gradient, optimization problem, prediction, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Belgium > Flanders (0.04)

Industry:

Energy (1.00)
Banking & Finance > Real Estate (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

AutomaticallyGeneratedCode

Neural Information Processing SystemsFeb-8-2026, 09:35:36 GMT

Code generation models have increasingly become integral to aiding software development.

gpt-3, large language model, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Europe > Austria > Vienna (0.14)
Africa > Rwanda > Kigali > Kigali (0.04)
North America > Canada > Ontario > Toronto (0.04)
(7 more...)

Genre: Research Report > New Finding (0.67)

Industry: Energy (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Software (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)

Add feedback

Reinforced Few-Shot Acquisition Function Learning for Bayesian Optimization

Neural Information Processing SystemsFeb-8-2026, 08:45:35 GMT

Bayesian optimization (BO) conventionally relies on handcrafted acquisition functions (AFs) to sequentially determine the sample points. However, it has been widely observed in practice that the best-performing AF in terms of regret can vary significantly under different types of black-box functions. It has remained a challenge to design one AF that can attain the best performance over a wide variety of black-box functions.

machine learning, optimization, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country: