AITopics | Education

Automating Dataset Updates Towards Reliable and Timely Evaluation of Large Language Models

Neural Information Processing SystemsOct-9-2025, 20:29:27 GMT

There are two updating strategies: 1) mimicking strategy to generate similar samples based on original data, preserving stylistic and contextual essence, and 2) extending strategy that further expands existing samples at varying cognitive levels by adapting Bloom's taxonomy of educational objectives.

arxiv preprint arxiv, cognitive level, dataset, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Mississippi (0.04)
Asia > Singapore (0.04)
North America > United States > Colorado > Weld County > Evans (0.04)
(3 more...)

Genre: Research Report > New Finding (0.93)

Industry:

Education (0.88)
Information Technology (0.67)
Leisure & Entertainment > Sports > Basketball (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (0.92)

Add feedback

1e6dcc16ffa7ced2228d1f2fdc8b5adf-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 20:29:17 GMT

abstract state, arp, evaluation, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > New Jersey (0.04)
North America > United States > Michigan (0.04)
North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
(3 more...)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine (0.68)
Education (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

A Scalable Module for Solving Top-k Problems

Neural Information Processing SystemsOct-9-2025, 20:22:30 GMT

The cost of ranking becomes significant in the new stage of deep learning.

dataset, experiment, individual loss, (17 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.04)
North America > United States > Wisconsin (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)
(2 more...)

Genre: Research Report > Experimental Study (1.00)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

1da38b872e19f1f4a3c2846720e8f64a-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 20:20:48 GMT

icl, in-context learning, scenario, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Michigan (0.04)
South America > Suriname > Paramaribo District > Paramaribo (0.04)
Europe > Liechtenstein (0.04)
(7 more...)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.67)

Industry:

Education (0.67)
Government (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
(2 more...)

Add feedback

DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated Text Detection Xiao Y u 1,2, Y uang Qi

Neural Information Processing SystemsOct-9-2025, 20:19:48 GMT

Consequently, detecting whether a text is generated by LLMs has become increasingly important.

candidate text, characteristic, intrinsic characteristic, (16 more...)

Neural Information Processing Systems

Country: Asia > China > Anhui Province > Hefei (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.68)

Industry:

Information Technology (0.68)
Education (0.67)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery Alex Rutherford Michael Beukman Timon Willi Bruno Lacerda Nick Hawes Jakob Foerster University of Oxford

Neural Information Processing SystemsOct-9-2025, 20:14:14 GMT

Put differently, current methods fail to predict intuitive measures of "learnability."

agent, jaxnav, learnability, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.40)
Asia > Middle East > Jordan (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(4 more...)

Genre: Research Report > Experimental Study (0.93)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Embedding-Aligned Language Models Guy Tennenholtz

Neural Information Processing SystemsOct-9-2025, 20:13:35 GMT

In this paper, we present a novel framework which accomplishes this by exploiting latent embedding spaces to define an objective function for an LLM in an iterative RL-driven process. As an example, consider the challenge of assisting content creators in generating valuable content within a recommender ecosystem (e.g., Y ouTube, Reddit, Spotify) [Boutilier et al., 2024].

character development, neural information processing system, supplemental material, (12 more...)

Neural Information Processing Systems

Country:

Africa > Middle East > Somalia > Banaadir > Mogadishu (0.04)
North America > United States > Colorado (0.04)
Asia > Vietnam (0.04)
(7 more...)

Genre: Research Report > Experimental Study (0.92)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Education > Educational Setting (0.92)
(2 more...)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Evaluating Large Vision-and-Language Models on Children's Mathematical Olympiads

Neural Information Processing SystemsOct-9-2025, 20:13:10 GMT

Gemini, etc.; some of these breakthroughs even seem to enable AI models to outperform human abilities in varied tasks that demand higher-order cognitive skills.

reasoning, shaded square, vlm, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Sweden > Värmland County > Karlstad (0.04)
Europe > France (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Education > Educational Setting > K-12 Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning Hao Ma

Neural Information Processing SystemsOct-9-2025, 20:11:47 GMT

Reinforcement learning (RL) has emerged as a pivotal technique for fine-tuning large language models (LLMs) on specific tasks. However, prevailing RL fine-tuning methods predominantly rely on PPO and its variants. Though these algorithms are effective in general RL settings, they often exhibit suboptimal performance and vulnerability to distribution collapse when applied to the fine-tuning of LLMs.

fine-tuning, kl divergence, task reward, (15 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
Asia > Macao (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.67)

Industry:

Education (0.93)
Leisure & Entertainment (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations

Neural Information Processing SystemsOct-9-2025, 20:05:56 GMT

The resulting features are evaluated on k-nearest neighbor classification over 11 datasets from vision, 5 from natural language processing, and 2 from audio.

backbone, fungus feature, gradient, (15 more...)

Neural Information Processing Systems

Country:

Europe > Netherlands > North Holland > Amsterdam (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Education (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
(4 more...)

Add feedback

Filters

Collaborating Authors

Education

Automating Dataset Updates Towards Reliable and Timely Evaluation of Large Language Models

1e6dcc16ffa7ced2228d1f2fdc8b5adf-Paper-Conference.pdf

A Scalable Module for Solving Top-k Problems

1da38b872e19f1f4a3c2846720e8f64a-Paper-Conference.pdf

DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated Text Detection Xiao Y u 1,2, Y uang Qi

No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery Alex Rutherford Michael Beukman Timon Willi Bruno Lacerda Nick Hawes Jakob Foerster University of Oxford

Embedding-Aligned Language Models Guy Tennenholtz

Evaluating Large Vision-and-Language Models on Children's Mathematical Olympiads

Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning Hao Ma

No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations