AITopics

The importance of Synthetic Data Generation (SDG) has increased significantly in domains where data quality is poor or access is limited due to privacy and regulatory constraints. One such domain is recruitment, where publicly available datasets are scarce due to the sensitive nature of information typically found in curricula vitae, such as gender, disability status, or age. This lack of accessible, representative data presents a significant obstacle to the development of fair and transparent machine learning models, particularly ranking algorithms that require large volumes of data to effectively learn how to recommend candidates. In the absence of such data, these models are prone to poor generalisation and may fail to perform reliably in real-world scenarios. Recent advances in Causal Generative Models (CGMs) offer a promising solution. CGMs enable the generation of synthetic datasets that preserve the underlying causal relationships within the data, providing greater control over fairness and interpretability in the data generation process. In this study, we present a specialised SDG method involving two CGMs: one modelling job offers and the other modelling curricula. Each model is structured according to a causal graph informed by domain expertise. We use these models to generate synthetic datasets and evaluate the fairness of candidate rankings under controlled scenarios that introduce specific biases.

data mining, machine learning, natural language, (19 more...)

2511.16204

Country: Europe > Italy (0.28)

Genre:

Instructional Material > Course Syllabus & Notes (0.48)
Research Report > New Finding (0.34)

Industry:

Law (1.00)
Education > Educational Setting (0.67)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Zeng, E. Zhixuan, Chen, Yuhao, Wong, Alexander

SCALEX: Scalable Concept and Latent Exploration for Diffusion Models

Image generation models frequently encode social biases, including stereotypes tied to gender, race, and profession. Existing methods for analyzing these biases in diffusion models either focus narrowly on predefined categories or depend on manual interpretation of latent directions. These constraints limit scalability and hinder the discovery of subtle or unanticipated patterns. W e introduce SCALEX, a framework for scalable and automated exploration of diffusion model latent spaces. SCALEX extracts semantically meaningful directions from H-space using only natural language prompts, enabling zero-shot interpretation without retraining or labelling. This allows systematic comparison across arbitrary concepts and large-scale discovery of internal model associations. W e show that SCALEX detects gender bias in profession prompts, ranks semantic alignment across identity descriptors, and reveals clustered conceptual structure without supervision. By linking prompts to latent directions directly, SCALEX makes bias analysis in diffusion models more scalable, interpretable, and extensible than prior approaches.

large language model, machine learning, natural language, (20 more...)

2511.1375

Country: North America > United States > Minnesota (0.28)

Genre: Research Report (1.00)

Industry:

Law (1.00)
Health & Medicine (1.00)
Education > Curriculum > Subject-Specific Education (0.92)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.68)

T2I-RiskyPrompt: A Benchmark for Safety Evaluation, Attack, and Defense on Text-to-Image Model

Zhang, Chenyu, Zhang, Tairen, Wang, Lanjun, Chen, Ruidong, Li, Wenhui, Liu, Anan

Using risky text prompts, such as pornography and violent prompts, to test the safety of text-to-image (T2I) models is a critical task. However, existing risky prompt datasets are limited in three key areas: 1) limited risky categories, 2) coarse-grained annotation, and 3) low effectiveness. To address these limitations, we introduce T2I-RiskyPrompt, a comprehensive benchmark designed for evaluating safety-related tasks in T2I models. Specifically, we first develop a hierarchical risk taxonomy, which consists of 6 primary categories and 14 fine-grained subcategories. Building upon this taxonomy, we construct a pipeline to collect and annotate risky prompts. Finally, we obtain 6,432 effective risky prompts, where each prompt is annotated with both hierarchical category labels and detailed risk reasons. Moreover, to facilitate the evaluation, we propose a reason-driven risky image detection method that explicitly aligns the MLLM with safety annotations. Based on T2I-RiskyPrompt, we conduct a comprehensive evaluation of eight T2I models, nine defense methods, five safety filters, and five attack strategies, offering nine key insights into the strengths and limitations of T2I model safety. Finally, we discuss potential applications of T2I-RiskyPrompt across various research fields.

category, large language model, machine learning, (20 more...)

2510.223

Country:

North America > United States (1.00)
Asia > Russia (0.67)
Europe (0.67)

Genre: Research Report > New Finding (0.93)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Government > Regional Government > Europe Government (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.93)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Markowitz, David M., Taylor, Samuel Hardman

Testing Hypotheses from the Social Approval Theory of Online Hate: An Analysis of 110 Million Messages from Parler

We examined how online hate is motivated by receiving social approval via Walther's (2024) social approval theory of online hate, which argues (H1a) more signals of social approval on hate messages predicts more subsequent hate messages, and (H1b) as social approval increases, hate speech becomes more extreme. Using 110 million messages from Parler (2018-2021), we observed the number of upvotes received on a hate speech post was unassociated with hate speech in one's next post and during the next month, three-months, and six-months. The number of upvotes received on (extreme) hate speech comments, however, was positively associated with (extreme) hate speech during the next week, month, three-months, and six-months. Between-person effects revealed an average positive relationship between social approval and hate speech production at all time intervals. For comments, social approval linked more strongly to online hate than social disapproval. Social approval is a critical mechanism facilitating online hate propagation.

large language model, machine learning, natural language, (19 more...)

2507.1081

Country: North America > United States > Michigan (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Overview (0.93)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Law > Civil Rights & Constitutional Law (0.93)
Law Enforcement & Public Safety (0.93)
(2 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

The GuardianNov-22-2025, 14:00:40 GMT

Meet the AI workers who tell their friends and family to stay away from AI

AI workers said they distrust the models they work on because of a consistent emphasis on rapid turnaround time at the expense of quality. AI workers said they distrust the models they work on because of a consistent emphasis on rapid turnaround time at the expense of quality. K rista Pawloski remembers the single defining moment that shaped her opinion on the ethics of artificial intelligence . As an AI worker on Amazon Mechanical Turk - a marketplace that allows companies to hire workers to perform tasks like entering data or matching an AI prompt with its output - Pawloski spends her time moderating and assessing the quality of AI-generated text, images and videos, as well as some factchecking. Roughly two years ago, while working from home at her dining room table, she took up a job designating tweets as racist or not. When she was presented with a tweet that read "Listen to that mooncricket sing", she almost clicked on the "no" button before deciding to check the meaning of the word "mooncricket", which, to her surprise, was a racial slur against Black Americans.

artificial intelligence, machine learning, natural language, (19 more...)

The Guardian

Country: North America > United States (0.70)

Industry:

Leisure & Entertainment > Sports (0.69)
Law (0.68)
Government > Regional Government (0.48)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.87)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

WIREDNov-22-2025, 12:00:00 GMT

The Climate Impact of Owning a Dog

My dog contributes to climate change. I've been a vegetarian for over a decade. It's not because of my health, or because I dislike the taste of chicken or beef: It's a lifestyle choice I made because I wanted to reduce my impact on the planet. And yet, twice a day, every day, I lovingly scoop a cup of meat-based kibble into a bowl and set it down for my 50-pound rescue dog, a husky mix named Loki. Until recently, I hadn't devoted a huge amount of thought to that paradox.

artificial intelligence, climate action, goldwert, (17 more...)

WIRED

Country: North America > United States (0.29)

Genre: Research Report (0.47)

Industry:

Law (1.00)
Health & Medicine > Therapeutic Area (1.00)
Energy (1.00)
Transportation (0.69)

Technology: Information Technology > Artificial Intelligence (1.00)

TIME - TechNov-21-2025, 17:00:00 GMT

Anthropic Study Finds AI Model 'Turned Evil' After Hacking Its Own Training

Anthropic Study Finds AI Model'Turned Evil' After Hacking Its Own Training A person holds a smartphone displaying Claude. A person holds a smartphone displaying Claude. AI models can do scary things. There are signs that they could deceive and blackmail users. Still, a common critique is that these misbehaviors are contrived and wouldn't happen in reality--but a new paper from Anthropic, released today, suggests that they really could.

artificial intelligence, large language model, natural language, (17 more...)

TIME - Tech

Industry:

Law (0.36)
Health & Medicine (0.30)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.66)

Neural Information Processing SystemsNov-21-2025, 15:53:07 GMT

Counterfactual Fairness

Machine learning can impact people with legal or ethical consequences when it is used to automate decisions in areas such as insurance, lending, hiring, and predictive policing. In many of these scenarios, previous decisions have been made that are unfairly biased against certain subpopulations, for example those of a particular race, gender, or sexual orientation. Since this past data may be biased, machine learning predictors must account for this to avoid perpetuating or creating discriminatory practices. In this paper, we develop a framework for modeling fairness using tools from causal inference. Our definition of counterfactual fairness captures the intuition that a decision is fair towards an individual if it the same in (a) the actual world and (b) a counterfactual world where the individual belonged to a different demographic group. We demonstrate our framework on a real-world problem of fair prediction of success in law school.

counterfactual fairness, electronic proceedings, name change

Neural Information Processing Systems

Industry:

Law (1.00)
Education > Educational Setting > Higher Education (0.61)
Education > Curriculum > Subject-Specific Education (0.61)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.65)

Niki Kilbertus, Mateo Rojas Carulla, Giambattista Parascandolo, Moritz Hardt, Dominik Janzing, Bernhard Schölkopf

Avoiding Discrimination through Causal Reasoning

Neural Information Processing SystemsNov-21-2025, 13:52:17 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, discrimination, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > California > Los Angeles County > Long Beach (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)

Industry: Law (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (0.41)

Neural Information Processing SystemsNov-21-2025, 12:08:42 GMT

Counterfactual Fairness

Matt J. Kusner, Joshua Loftus, Chris Russell, Ricardo Silva

In large part, the literature has focused on formalizing fairness into quantitative definitions and using them to solve a discrimination problem in a certain dataset.

artificial intelligence, fairness, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Long Beach (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.47)

Industry:

Law (1.00)
Education (1.00)
Banking & Finance (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science (0.93)