AITopics

2508.15239

Country: Asia > Thailand (0.28)

Genre: Research Report > New Finding (1.00)

Industry:

Law > Statutes (1.00)
Law > Civil Rights & Constitutional Law (1.00)
Information Technology > Security & Privacy (1.00)
(9 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Oozeer, Narmeen, Marks, Luke, Barez, Fazl, Abdullah, Amirali

Beyond Linear Steering: Unified Multi-Attribute Control for Language Models

arXiv.org Artificial IntelligenceSep-22-2025

Controlling multiple behavioral attributes in large language models (LLMs) at inference time is a challenging problem due to interference between attributes and the limitations of linear steering methods, which assume additive behavior in activation space and require per-attribute tuning. We introduce K-Steering, a unified and flexible approach that trains a single non-linear multi-label classifier on hidden activations and computes intervention directions via gradients at inference time. This avoids linearity assumptions, removes the need for storing and tuning separate attribute vectors, and allows dynamic composition of behaviors without retraining. To evaluate our method, we propose two new benchmarks, ToneBank and DebateMix, targeting compositional behavioral control. Empirical results across 3 model families, validated by both activation-based classifiers and LLM-based judges, demonstrate that K-Steering outperforms strong baselines in accurately steering multiple behaviors.

large language model, machine learning, natural language, (18 more...)

2505.24535

Country: Europe > United Kingdom (0.46)

Genre: Research Report > New Finding (1.00)

Industry:

Law > Statutes (1.00)
Law > Civil Rights & Constitutional Law (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
(5 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Al JazeeraSep-21-2025, 20:48:37 GMT

US children among five killed in Israeli drone strike on southern Lebanon

Why is Israel still in southern Lebanon? A war to shape Lebanon's future An Israeli drone strike has killed five people, including three children, in the southern Lebanese town of Bint Jbeil, Lebanon's Health Ministry has said, as Israel continues to target its neighbour despite a US-brokered truce that took effect in November. The state-run National News Agency (NNA) reported on Sunday that the strike targeted a motorcycle and a vehicle, and wounded two other people. Why then did Israel attack Syria? The mother of the children was injured in the attack.

israeli drone strike, lebanon, southern lebanon, (9 more...)

Al Jazeera

Country:

Asia > Middle East > Lebanon (1.00)
Asia > Middle East > Israel (0.96)
Asia > Middle East > Syria (0.27)
(9 more...)

Industry:

Government > Military (0.74)
Information Technology > Robotics & Automation (0.63)
Law > International Law (0.51)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.63)

The GuardianSep-21-2025, 11:00:43 GMT

Chatbot site depicting child sexual abuse images raises fears over misuse of AI

The IWF said it had been alerted to a chatbot site that offered scenarios including'child prostitute in a hotel' and'child and teacher alone after class'. The IWF said it had been alerted to a chatbot site that offered scenarios including'child prostitute in a hotel' and'child and teacher alone after class'. A chatbot site offering explicit scenarios with preteen characters, illustrated by illegal abuse images has raised fresh fears about the misuse of artificial intelligence. A report by a child safety watchdog has triggered calls for the UK government to impose safety guidelines on AI companies, amid a surge in child sexual abuse material (CSAM) created by the technology. The Internet Watch Foundation said it had been alerted to a chatbot site that offered a number of scenarios including "child prostitute in a hotel", "sex with your child while your wife is on holiday" and "child and teacher alone after class".

iwf, scenario, sexual abuse image raise fear, (12 more...)

The Guardian

Country:

Europe > United Kingdom (1.00)
North America > United States (0.17)
Europe > Ukraine (0.06)
(2 more...)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law (1.00)
Government > Regional Government > Europe Government > United Kingdom Government (0.73)
Health & Medicine > Therapeutic Area > Pediatrics/Neonatology (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.86)

WIREDSep-19-2025, 10:30:00 GMT

Meta Accused of Torrenting Porn to Advance Its Goal of AI 'Superintelligence'

The complaint, filed in July, alleges Meta has been torrenting and seeding Strike 3's videos since 2018. Associated exhibits and details of the complaint were unsealed last week. Strike 3 alleges Meta's motive was partly to obtain otherwise difficult to scrape visual angles, parts of the human body, and extended, uninterrupted scenes--rare in mainstream movies and TV--to help it create what Mark Zuckerberg calls AI "superintelligence." "They have an interest in getting our content because it can give them a competitive advantage for the quality, fluidity, and humanity of the AI," alleges Christian Waugh, an attorney for Strike 3. This process made Strike 3's porn videos accessible to minors, the complaint alleges, since BitTorrent does not have age verification.

ai model, meta, strike 3, (13 more...)

WIRED

Country:

Asia (0.15)
South America (0.05)
North America > United States > New York (0.05)
(4 more...)

Industry: Law > Litigation (0.86)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.49)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.30)

The Japan TimesSep-19-2025, 02:01:00 GMT

ChatGPT was used 'to help scammers do their thing' at Asia fraud compound

ChatGPT was used'to help scammers do their thing' at Asia fraud compound ChatGPT owner OpenAI says it actively works to identify and disrupt scam-related misuse of ChatGPT." Duncan Okindo says he was lured to Southeast Asia last year by the promise of a customer service job in Thailand. Instead, he ended up spending four months in a scam compound on the lawless Myanmar-Thai border, where he saw first-hand how criminal groups are at scale. Okindo, 26, says he was struggling to find a job as the breadwinner for his family in his native Kenya when a local recruitment agency promised him work in Bangkok. The flight was his first trip overseas. On landing, he says, he was abducted at the airport and spirited across the border, into the notorious KK Park complex, guarded by heavily armed men and fortified like it was meant for war."

chatgpt, compound, help scammer, (10 more...)

The Japan Times

Country:

Asia > Thailand > Bangkok > Bangkok (0.25)
Asia > Southeast Asia (0.25)
Asia > Myanmar (0.25)
(6 more...)

Industry:

Information Technology > Security & Privacy (0.73)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.62)
Law > Criminal Law (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Hu, Botao Amber, Liu, Yuhan, Rong, Helena

Trustless Autonomy: Understanding Motivations, Benefits, and Governance Dilemmas in Self-Sovereign Decentralized AI Agents

The recent trend of self-sovereign Decentralized AI Agents (DeAgents) combines Large Language Model (LLM)-based AI agents with decentralization technologies such as blockchain smart contracts and trusted execution environments (TEEs). These tamper-resistant trustless substrates allow agents to achieve self-sovereignty through ownership of cryptowallet private keys and control of digital assets and social media accounts. DeAgents eliminate centralized control and reduce human intervention, addressing key trust concerns inherent in centralized AI systems. This contributes to social computing by enabling new human cooperative paradigm "intelligence as commons." However, given ongoing challenges in LLM reliability such as hallucinations, this creates paradoxical tension between trustlessness and unreliable autonomy. This study addresses this empirical research gap through interviews with DeAgents stakeholders-experts, founders, and developers-to examine their motivations, benefits, and governance dilemmas. The findings will guide future DeAgents system and protocol design and inform discussions about governance in sociotechnical AI systems in the future agentic web.

large language model, machine learning, natural language, (18 more...)

2505.09757

Country:

North America > United States (1.00)
Europe (1.00)
Asia (1.00)

Genre: Research Report > New Finding (0.93)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Banking & Finance > Trading (1.00)
Social Sector (0.93)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Zheng, Ying, Jiang, Yangfan, Tan, Kian-Lee

CausalPre: Scalable and Effective Data Pre-processing for Causal Fairness

Abstract--Causal fairness in databases is crucial to preventing biased and inaccurate outcomes in downstream tasks. While most prior work assumes a known causal model, recent efforts relax this assumption by enforcing additional constraints. However, these approaches often fail to capture broader attribute relationships that are critical to maintaining utility. This raises a fundamental question: Can we harness the benefits of causal reasoning to design efficient and effective fairness solutions without relying on strong assumptions about the underlying causal model? In this paper, we seek to answer this question by introducing CausalPre, a scalable and effective causality-guided data pre-processing framework that guarantees justifiable fairness, a strong causal notion of fairness. CausalPre extracts causally fair relationships by reformulating the originally complex and computationally infeasible extraction task into a tailored distribution estimation problem. T o ensure scalability, CausalPre adopts a carefully crafted variant of low-dimensional marginal factorization to approximate the joint distribution, complemented by a heuristic algorithm that efficiently tackles the associated computational challenge. Extensive experiments on benchmark datasets demonstrate that CausalPre is both effective and scalable, challenging the conventional belief that achieving causal fairness requires trading off relationship coverage for relaxed model assumptions. Machine learning (ML) systems are increasingly integrated into decision-making processes in domains such as education [1], finance [2], employment [3], advertising [4], and law enforcement [5], [6]. While these systems offer efficiency and scalability, they also pose serious concerns about fairness [7]- [14]. In particular, their reliance on historical data can unintentionally amplify biases, producing inaccurate, discriminatory outcomes with severe real-world impacts in high-stakes areas like criminal justice. These concerns have motivated the development of fairness-aware data pre-processing techniques within database management systems (DBMS) [15]-[22]. Compared to traditional fairness interventions at the model training or inference stages [23]-[28], pre-processing methods offer: (i) a once-for-all benefit, meaning that once data is calibrated for fairness, it can be used in any downstream task, regardless of the ML model employed; and (ii) a user-friendly workflow, as fairness considerations are directly embedded into the data pre-processing pipeline, enabling practitioners to focus on the downstream task without specialized fairness expertise. A straightforward approach to achieve this is to remove all sensitive attributes (e.g., gender and race) from the training data. However, such ad hoc solutions often fail in practice, as non-sensitive attributes may act as proxies for sensitive ones, particularly when strong correlations exist [18], [29].

artificial intelligence, data quality, machine learning, (20 more...)

2509.15199

Genre: Research Report > New Finding (0.67)

Industry:

Law (0.68)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.34)

Technology:

Information Technology > Data Science > Data Quality > Data Cleaning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Emergent Alignment via Competition

Collina, Natalie, Goel, Surbhi, Roth, Aaron, Ryu, Emily, Shi, Mirah

Aligning AI systems with human values remains a fundamental challenge, but does our inability to create perfectly aligned models preclude obtaining the benefits of alignment? We study a strategic setting where a human user interacts with multiple differently misaligned AI agents, none of which are individually well-aligned. Our key insight is that when the users utility lies approximately within the convex hull of the agents utilities, a condition that becomes easier to satisfy as model diversity increases, strategic competition can yield outcomes comparable to interacting with a perfectly aligned model. We model this as a multi-leader Stackelberg game, extending Bayesian persuasion to multi-round conversations between differently informed parties, and prove three results: (1) when perfect alignment would allow the user to learn her Bayes-optimal action, she can also do so in all equilibria under the convex hull condition (2) under weaker assumptions requiring only approximate utility learning, a non-strategic user employing quantal response achieves near-optimal utility in all equilibria and (3) when the user selects the best single AI after an evaluation period, equilibrium guarantees remain near-optimal without further distributional assumptions. We complement the theory with two sets of experiments.

artificial intelligence, machine learning, natural language, (20 more...)

2509.1509

Country: Europe (0.14)

Genre: Research Report (1.00)

Industry:

Media > Film (1.00)
Law (0.93)
Leisure & Entertainment > Games (0.67)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.67)

Gosmar, Diego, Dahl, Deborah A.

Sentinel Agents for Secure and Trustworthy Agentic AI in Multi-Agent Systems

This paper proposes a novel architectural framework aimed at enhancing security and reliability in multi-agent systems (MAS). A central component of this framework is a network of Sentinel Agents, functioning as a distributed security layer that integrates techniques such as semantic analysis via large language models (LLMs), behavioral analytics, retrieval-augmented verification, and cross-agent anomaly detection. Such agents can potentially oversee inter-agent communications, identify potential threats, enforce privacy and access controls, and maintain comprehensive audit records. Complementary to the idea of Sentinel Agents is the use of a Coordinator Agent. The Coordinator Agent supervises policy implementation, and manages agent participation. In addition, the Coordinator also ingests alerts from Sentinel Agents. Based on these alerts, it can adapt policies, isolate or quarantine misbehaving agents, and contain threats to maintain the integrity of the MAS ecosystem. This dual-layered security approach, combining the continuous monitoring of Sentinel Agents with the governance functions of Coordinator Agents, supports dynamic and adaptive defense mechanisms against a range of threats, including prompt injection, collusive agent behavior, hallucinations generated by LLMs, privacy breaches, and coordinated multi-agent attacks. In addition to the architectural design, we present a simulation study where 162 synthetic attacks of different families (prompt injection, hallucination, and data exfiltration) were injected into a multi-agent conversational environment. The Sentinel Agents successfully detected the attack attempts, confirming the practical feasibility of the proposed monitoring approach. The framework also offers enhanced system observability, supports regulatory compliance, and enables policy evolution over time.

agent, artificial intelligence, sentinel agent, (15 more...)

2509.14956

Country: North America > United States (0.67)

Genre: Research Report (1.00)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.46)