AITopics | glue

Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads

Neural Information Processing SystemsFeb-17-2026, 09:49:14 GMT

Pre-trained Language Models (LMs) exhibit strong zero-shot and in-context learning capabilities; however, their behaviors are often difficult to control.

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.93)

Add feedback

This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish

Neural Information Processing SystemsDec-24-2025, 17:30:22 GMT

The availability of compute and data to train larger and larger language models increases the demand for robust methods of benchmarking the true progress of LM training. Recent years witnessed significant progress in standardized benchmarking for English. Benchmarks such as GLUE, SuperGLUE, or KILT have become a de facto standard tools to compare large language models. Following the trend to replicate GLUE for other languages, the KLEJ benchmark\ (klej is the word for glue in Polish) has been released for Polish. In this paper, we evaluate the progress in benchmarking for low-resourced languages.

benchmark, comprehensive nlp benchmark, lepiszcze, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language (0.81)

Add feedback

ad3d0ac42b4b5cc3b5f0ca10107d5c84-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 13:04:00 GMT

experiment, instruction, language model, (13 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.93)

Add feedback

Large deep learning models such as BERT and ResNet achieve state-of-the-art performance but are costly to deploy at the edge due to their size and compute demands. We present RMT-KD, a compression method that leverages Random Matrix Theory (RMT) for knowledge distillation to iteratively reduce network size. Instead of pruning or heuristic rank selection, RMT-KD preserves only informative directions identified via the spectral properties of hidden representations. RMT-based causal reduction is applied layer by layer with self-distillation to maintain stability and accuracy. On GLUE, AG News, and CIFAR-10, RMT-KD achieves up to 80% parameter reduction with only 2% accuracy loss, delivering 2.8x faster inference and nearly halved power consumption. These results establish RMT-KD as a mathematically grounded approach to network distillation.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2509.15724

Country: North America > United States > Illinois (0.14)

Genre: Research Report (0.51)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

GLUE: Global-Local Unified Encoding for Imitation Learning via Key-Patch Tracking

Chen, Ye, Zhou, Zichen, Dou, Jianyu, Cui, Te, Yang, Yi, Yue, Yufeng

arXiv.org Artificial IntelligenceSep-30-2025

In recent years, visual representation learning has gained widespread attention in robotic imitation learning. However, in complex Out-of-Distribution(OOD) settings characterized by clutter and occlusion, the attention of global visual representations can be diluted or interfered, leading to degraded policy performance. The invariance of local representations for task-relevant objects offers a solution. By efficiently utilizing these local representations, training and testing data can be mapped to a more similar feature space, thereby mitigating the covariate shift problem. Accordingly, we propose GLUE, a global-local unified encoding framework for imitation learning based on key-patch tracking. GLUE selects and tracks key-patches as critical local representations by employing a text-guided mechanism. It features a novel fusion framework where global patch features query local patches to distill essential information, yielding fine-grained local features with low heterogeneity relative to the global context. This fused representation steers the robot's visual attention toward task-relevant objects and preserves precise global context, which together align the training and testing distributions into a similar and task-informative feature space, ultimately enhancing the robustness of the imitation learning policy. Experiments demonstrate that GLUE achieves strong performance across diverse tasks in both simulation and real-world settings, outperforming the strongest baseline by 17.6% in simulation, 36.3% in real-world environments, and 58.3% on real-world generalization settings. The project website of GLUE is available at https://GLUE666.github.io/.

artificial intelligence, machine learning, representation, (14 more...)

arXiv.org Artificial Intelligence

2509.2322

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

c2c2a04512b35d13102459f8784f1a2d-Supplemental.pdf

Neural Information Processing SystemsAug-17-2025, 05:48:13 GMT

machine learning, natural language, train example, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.99)

Add feedback

This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish

Neural Information Processing SystemsJan-17-2025, 12:52:11 GMT

The availability of compute and data to train larger and larger language models increases the demand for robust methods of benchmarking the true progress of LM training. Recent years witnessed significant progress in standardized benchmarking for English. Benchmarks such as GLUE, SuperGLUE, or KILT have become a de facto standard tools to compare large language models. Following the trend to replicate GLUE for other languages, the KLEJ benchmark\ (klej is the word for glue in Polish) has been released for Polish. In this paper, we evaluate the progress in benchmarking for low-resourced languages.

benchmark, comprehensive nlp benchmark, lepiszcze, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language (0.83)

Add feedback

OpenAI's New Ad Shows 'Reasoning' AI Making Basic Errors

TIME - TechDec-6-2024, 14:07:32 GMT

OpenAI released its most advanced AI model yet, called o1, for paying users on Thursday. The launch kicked off the company's "12 Days of OpenAI" event--a dozen consecutive releases to celebrate the holiday season. OpenAI has touted o1's "complex reasoning" capabilities, and announced on Thursday that unlimited access to the model would cost 200 per month. In the video the company released to show the model's strengths, a user uploads a picture of a wooden birdhouse and asks the model for advice on how to build a similar one. The model "thinks" for a short period and then spits out what on the surface appears to be a comprehensive set of instructions. Close examination reveals the instructions to be almost useless.

birdhouse, dimension, openai, (7 more...)

TIME - Tech

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Filters

Collaborating Authors

glue

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads

This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish

ad3d0ac42b4b5cc3b5f0ca10107d5c84-Paper-Conference.pdf

4496bf24afe7fab6f046bf4923da8de6-AuthorFeedback.pdf

2b8501af7b64d1aaae7dd832805f0709-AuthorFeedback.pdf

RMT-KD: Random Matrix Theoretic Causal Knowledge Distillation

GLUE: Global-Local Unified Encoding for Imitation Learning via Key-Patch Tracking

c2c2a04512b35d13102459f8784f1a2d-Supplemental.pdf

This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish

OpenAI's New Ad Shows 'Reasoning' AI Making Basic Errors