AITopics | Energy

Catastrophic Goodhart: regularizing RLHF with KL divergence does not mitigate heavy-tailed reward misspecification

Neural Information Processing SystemsOct-9-2025, 19:56:40 GMT

However, if error is heavy-tailed, some policies obtain arbitrarily high reward despite achieving no more utility than the base model-a phenomenon we call catastrophic Goodhart. We adapt a discrete optimization method to measure the tails of reward models, finding that they are consistent with light-tailed error.

goodhart, kl divergence, optimization, (15 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry:

Information Technology (0.46)
Energy (0.46)
Banking & Finance (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.84)

Add feedback

MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image Generation Jialin Luo 1,, Y uanzhi Wang

Neural Information Processing SystemsOct-9-2025, 19:26:03 GMT

Extensive experimental results verify that our proposed MMM-RS dataset allows off-the-shelf diffusion models to generate diverse RS images across various modalities, scenes, weather conditions, and GSD.

dataset, mmm-rs dataset, text prompt, (13 more...)

Neural Information Processing Systems

Country:

Asia > China > Jiangsu Province > Nanjing (0.04)
Asia > China > Beijing > Beijing (0.04)
Asia > China > Shandong Province (0.04)

Genre: Research Report > New Finding (0.94)

Industry: Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.48)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

15807b6e09d691fe5e96cdecde6d7b80-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsOct-9-2025, 19:17:32 GMT

canonical solution, efficiency, gpt-3, (9 more...)

Neural Information Processing Systems

Country:

Europe > Austria > Vienna (0.14)
Africa > Rwanda > Kigali > Kigali (0.04)
North America > Canada > Ontario > Toronto (0.04)
(10 more...)

Genre: Research Report > New Finding (0.93)

Industry: Energy (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.99)

Add feedback

12c118ef87fde56a10bd858842781b34-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 18:58:11 GMT

basis function, charge density, prediction, (14 more...)

Neural Information Processing Systems

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.93)

Industry:

Energy (0.46)
Materials > Chemicals > Commodity Chemicals > Petrochemicals (0.46)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Data-Efficient Operator Learning via Unsupervised Pretraining and In-Context Learning Wuyang Chen Simon Fraser University Jialin Song

Neural Information Processing SystemsOct-9-2025, 18:19:45 GMT

Extensive empirical evaluations on a diverse set of PDEs demonstrate that our method is highly data-efficient, more gener-alizable, and even outperforms conventional vision-pretrained models.

equation, neural operator, unlabeled pde data, (12 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.93)

Industry: Energy (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

Federated Ensemble-Directed Offline Reinforcement Learning

Neural Information Processing SystemsOct-9-2025, 18:19:27 GMT

We consider the problem of federated offline reinforcement learning (RL), a scenario under which distributed learning agents must collaboratively learn a high-quality control policy only using small pre-collected datasets generated according to different unknown behavior policies. Naïvely combining a standard offline RL approach with a standard federated learning approach to solve this problem can lead to poorly performing policies. In response, we develop the Federated Ensemble-Directed Offline Reinforcement Learning Algorithm (FEDORA), which distills the collective wisdom of the clients using an ensemble learning approach. We develop the FEDORA codebase to utilize distributed compute resources on a federated learning platform. We show that FEDORA significantly outperforms other approaches, including offline RL over the combined data pool, in various complex continuous control environments and real-world datasets.

dataset, federation, fedora, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas (0.04)
North America > United States > Virginia (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry:

Energy (0.93)
Information Technology > Security & Privacy (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

080be5eb7e887319ff30c792c2cbc28c-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 17:52:16 GMT

dataset, domain generalization, generalization, (13 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.67)

Industry:

Government (0.46)
Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)

Add feedback

Searching for Efficient Linear Layers over a Continuous Space of Structured Matrices

Neural Information Processing SystemsOct-9-2025, 17:44:40 GMT

In contrast to the standard sparse MoE for each entire feed-forward network, BTT -MoE learns an MoE in every single linear layer of the model, including the projection matrices in the attention blocks.

einsum, experiment, matrix, (13 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.93)

Industry: Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Vision (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

SolarCube: Supplementary Information

Neural Information Processing SystemsOct-9-2025, 17:43:08 GMT

Hierarchical vision transformer using shifted windows," in Proceedings of the IEEE/CVF

dataset, dimension, study area, (15 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
Oceania > Australia (0.05)
North America > United States > Maryland (0.05)
(6 more...)

Genre: Research Report (0.68)

Industry: Energy > Renewable > Solar (0.73)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.97)
Information Technology > Artificial Intelligence > Vision (0.86)

Add feedback

SolarCube: An Integrative Benchmark Dataset Harnessing Satellite and In-situ Observations for Large-scale Solar Energy Forecasting

Neural Information Processing SystemsOct-9-2025, 17:43:04 GMT

However, the cloud induced-variability of solar radiation reaching the earth's surface presents a challenge for integrating solar power into the grid (e.g., storage and backup management).

forecasting, radiation, solar radiation, (15 more...)

Neural Information Processing Systems

Country: