AITopics

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Education (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsFeb-8-2026, 02:15:43 GMT

12d286282e1be5431ea05262a21f415c-Paper-Conference.pdf

Knowledge distillation (KD) has been widely used to improve the test accuracy of a "student" network, by training it to mimic the soft probabilities of a trained

artificial intelligence, distillation, machine learning, (19 more...)

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Education (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsDec-23-2025, 23:32:21 GMT

On student-teacher deviations in distillation: does it pay to disobey?

Knowledge distillation (KD) has been widely used to improve the test accuracy of a student network, by training it to mimic the soft probabilities of a trained teacher network. Yet, it has been shown in recent work that, despite being trained to fit the teacher's probabilities, the student may not only significantly deviate from the teacher probabilities, but may also outdo than the teacher in performance. Our work aims to reconcile this seemingly paradoxical observation. Specifically, we characterize the precise nature of the student-teacher deviations, and argue how they co-occur with better generalization. First, through experiments on image and language data, we identify that these probability deviations correspond to the student systematically the confidence levels of the teacher.Next, we theoretically and empirically establish another form of exaggeration in some simple settings: KD exaggerates the implicit bias of gradient descent in converging faster along the top eigendirections of the data. Finally, we tie these two observations together: we demonstrate that the exaggerated bias of KD can simultaneously result in both (a) the exaggeration of confidence and (b) the improved generalization of the student, thus offering a resolution to the apparent paradox. Our analysis brings existing theory and practice closer by considering the role of gradient descent in KD and by demonstrating the exaggerated bias effect in both theoretical and empirical settings.

distillation, name change, student-teacher deviation, (8 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.62)

Nudo, Jacopo, Pandolfo, Mario Edoardo, Loru, Edoardo, Samory, Mattia, Cinelli, Matteo, Quattrociocchi, Walter

Generative Exaggeration in LLM Social Agents: Consistency, Bias, and Toxicity

arXiv.org Artificial IntelligenceJul-2-2025

We investigate how Large Language Models (LLMs) behave when simulating political discourse on social media. Leveraging 21 million interactions on X during the 2024 U.S. presidential election, we construct LLM agents based on 1,186 real users, prompting them to reply to politically salient tweets under controlled conditions. Agents are initialized either with minimal ideological cues (Zero Shot) or recent tweet history (Few Shot), allowing one-to-one comparisons with human replies. We evaluate three model families (Gemini, Mistral, and DeepSeek) across linguistic style, ideological consistency, and toxicity. We find that richer contextualization improves internal consistency but also amplifies polarization, stylized signals, and harmful language. We observe an emergent distortion that we call "generation exaggeration": a systematic amplification of salient traits beyond empirical baselines. Our analysis shows that LLMs do not emulate users, they reconstruct them. Their outputs, indeed, reflect internal optimization dynamics more than observed behavior, introducing structural biases that compromise their reliability as social proxies. This challenges their use in content moderation, deliberative simulations, and policy modeling.

large language model, machine learning, natural language, (17 more...)

2507.00657

Country:

North America > United States > New York (0.04)
Europe > Italy (0.04)
Europe > Ukraine (0.04)
Asia > China (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Government > Voting & Elections (0.86)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Neural Information Processing SystemsMay-28-2025, 11:26:52 GMT

On student-teacher deviations in distillation: does it pay to disobey?

Knowledge distillation (KD) has been widely used to improve the test accuracy of a "student" network, by training it to mimic the soft probabilities of a trained "teacher" network. Yet, it has been shown in recent work that, despite being trained to fit the teacher's probabilities, the student may not only significantly deviate from the teacher probabilities, but may also outdo than the teacher in performance. Our work aims to reconcile this seemingly paradoxical observation. Specifically, we characterize the precise nature of the student-teacher deviations, and argue how they can co-occur with better generalization. First, through experiments on image and language data, we identify that these probability deviations correspond to the student systematically exaggerating the confidence levels of the teacher.Next, we theoretically and empirically establish another form of exaggeration in some simple settings: KD exaggerates the implicit bias of gradient descent in converging faster along the top eigendirections of the data.

artificial intelligence, machine learning, student-teacher deviation, (6 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.45)

arXiv.org Artificial IntelligenceFeb-7-2025

The Battling Influencers Game: Nash Equilibria Structure of a Potential Game and Implications to Value Alignment

Wu, Young, Zhu, Yancheng, Cai, Jin-Yi, Zhu, Xiaojin

When multiple influencers attempt to compete for a receiver's attention, their influencing strategies must account for the presence of one another. We introduce the Battling Influencers Game (BIG), a multi-player simultaneous-move general-sum game, to provide a game-theoretic characterization of this social phenomenon. We prove that BIG is a potential game, that it has either one or an infinite number of pure Nash equilibria (NEs), and these pure NEs can be found by convex optimization. Interestingly, we also prove that at any pure NE, all (except at most one) influencers must exaggerate their actions to the maximum extent. In other words, it is rational for the influencers to be non-truthful and extreme because they anticipate other influencers to cancel out part of their influence. We discuss the implications of BIG to value alignment.

artificial intelligence, machine learning, natural language, (17 more...)

2502.01127

Country:

North America > United States > Wisconsin > Dane County > Madison (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China > Jiangsu Province > Yancheng (0.04)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.69)

Neural Information Processing SystemsOct-9-2024, 21:25:17 GMT

On student-teacher deviations in distillation: does it pay to disobey?

distillation, probability, student-teacher deviation, (4 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.45)

arXiv.org Artificial IntelligenceApr-11-2024

A Multi-Label Dataset of French Fake News: Human and Machine Insights

Icard, Benjamin, Maine, François, Casanova, Morgane, Faye, Géraud, Chanson, Julien, Gadek, Guillaume, Atemezing, Ghislain, Bancilhon, François, Égré, Paul

We present a corpus of 100 documents, named OBSINFOX, selected from 17 sources of French press considered unreliable by expert agencies, annotated using 11 labels by 8 annotators. By collecting more labels than usual, by more annotators than is typically done, we can identify features that humans consider as characteristic of fake news, and compare them to the predictions of automated classifiers. We present a topic and genre analysis using GATE Cloud, indicative of the prevalence of satire-like text in the corpus. We then use the subjectivity analyzer VAGO, and a neural version of it, to clarify the link between ascriptions of the label Subjective and ascriptions of the label Fake News.

machine learning, natural language, subjective, (17 more...)

2403.16099

Country:

Europe > France (0.06)
North America > United States > Maine (0.04)
North America > Canada > Ontario > Toronto (0.04)
Asia (0.04)

Genre: Research Report (1.00)

Industry: Media > News (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

arXiv.org Artificial IntelligenceFeb-7-2024

Exposing propaganda: an analysis of stylistic cues comparing human annotations and machine classification

Faye, Géraud, Icard, Benjamin, Casanova, Morgane, Chanson, Julien, Maine, François, Bancilhon, François, Gadek, Guillaume, Gravier, Guillaume, Égré, Paul

This paper investigates the language of propaganda and its stylistic features. It presents the PPN dataset, standing for Propagandist Pseudo-News, a multisource, multilingual, multimodal dataset composed of news articles extracted from websites identified as propaganda sources by expert agencies. A limited sample from this set was randomly mixed with papers from the regular French press, and their URL masked, to conduct an annotation-experiment by humans, using 11 distinct labels. The results show that human annotators were able to reliably discriminate between the two types of press across each of the labels. We propose different NLP techniques to identify the cues used by the annotators, and to compare them with machine classification. They include the analyzer VAGO to measure discourse vagueness and subjectivity, a TF-IDF to serve as a baseline, and four different classifiers: two RoBERTa-based models, CATS using syntax, and one XGBoost combining syntactic and semantic features.

corpus, dataset, propaganda, (16 more...)

2402.0378

Country:

Asia > Russia (0.68)
Europe > France (0.15)
Europe > Ukraine > Kyiv Oblast > Kyiv (0.14)
(6 more...)

Genre: Research Report > New Finding (0.34)

Industry:

Media > News (0.95)
Government > Regional Government > Europe Government > Russia Government (0.46)
Government > Regional Government > Asia Government > Russia Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.89)

Nagarajan, Vaishnavh, Menon, Aditya Krishna, Bhojanapalli, Srinadh, Mobahi, Hossein, Kumar, Sanjiv

On student-teacher deviations in distillation: does it pay to disobey?

arXiv.org Artificial IntelligenceAug-1-2023

Knowledge distillation (KD) has been widely-used to improve the test accuracy of a ``student'' network by training the student to mimic soft probabilities of a trained "teacher" network. Yet, it has been shown in recent work that, despite being trained to fit the teacher's probabilities, the student not only significantly deviates from these probabilities, but also performs even better than the teacher. Our work aims to reconcile this seemingly paradoxical observation by characterizing the precise nature of the student-teacher deviations, and by arguing how they can co-occur with better generalization. First, through experiments on image and language data, we identify that these deviations correspond to the student systematically exaggerating the confidence levels of the teacher. Next, we theoretically and empirically establish in some simple settings that KD also exaggerates the implicit bias of gradient descent in converging faster along the top eigendirections of the data. Finally, we demonstrate that this exaggerated bias effect can simultaneously result in both (a) the exaggeration of confidence and (b) the improved generalization of the student, thus offering a resolution to the apparent paradox. Our analysis brings existing theory and practice closer by considering the role of gradient descent in KD and by demonstrating the exaggerated bias effect in both theoretical and empirical settings.

artificial intelligence, distillation, machine learning, (18 more...)

2301.12923

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
North America > United States > New York > New York County > New York City (0.04)
(2 more...)

Genre: Research Report > New Finding (0.45)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.86)