AITopics

Medical Lay Language Generation (MLLG) plays a vital role in improving the accessibility of complex scientific content for broader audiences. Recent literature to MLLG commonly employ parameter-efficient fine-tuning methods such as Low-Rank Adaptation (LoRA) to fine-tuning large language models (LLMs) using paired expert-lay language datasets. However, LoRA struggles with the challenges posed by multi-source heterogeneous MLLG datasets. Specifically, through a series of exploratory experiments, we reveal that standard LoRA fail to meet the requirement for semantic fidelity and diverse lay-style generation in MLLG task. To address these limitations, we propose Magical, an asymmetric LoRA architecture tailored for MLLG under heterogeneous data scenarios. Magical employs a shared matrix $A$ for abstractive summarization, along with multiple isolated matrices $B$ for diverse lay-style generation. To preserve semantic fidelity during the lay language generation process, Magical introduces a Semantic Invariance Constraint to mitigate semantic subspace shifts on matrix $A$. Furthermore, to better adapt to diverse lay-style generation, Magical incorporates the Recommendation-guided Switch, an externally interface to prompt the LLM to switch between different matrices $B$. Experimental results on three real-world lay language generation datasets demonstrate that Magical consistently outperforms prompt-based methods, vanilla LoRA, and its recent variants, while also reducing trainable parameters by 31.66%. Our code is publicly available at https://github.com/tianlwang/Magical.git.

large language model, machine learning, natural language, (19 more...)

2508.0873

Country: Asia > China (0.46)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Law (0.93)
Information Technology (0.92)
Health & Medicine > Therapeutic Area (0.68)
Health & Medicine > Health Care Technology > Medical Record (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Generation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Modeling the Economic Impacts of AI Openness Regulation

Qiu, Tori, Laufer, Benjamin, Kleinberg, Jon, Heidari, Hoda

Regulatory frameworks, such as the EU AI Act, encourage openness of general-purpose AI models by offering legal exemptions for "open-source" models. Despite this legislative attention on openness, the definition of open-source foundation models remains ambiguous. This paper models the strategic interactions among the creator of a general-purpose model (the generalist) and the entity that fine-tunes the general-purpose model to a specialized domain or task (the specialist), in response to regulatory requirements on model openness. We present a stylized model of the regulator's choice of an open-source definition to evaluate which AI openness standards will establish appropriate economic incentives for developers. Our results characterize market equilibria -- specifically, upstream model release decisions and downstream fine-tuning efforts -- under various openness regulations and present a range of effective regulatory penalties and open-source thresholds. Overall, we find the model's baseline performance determines when increasing the regulatory penalty vs. the open-source threshold will significantly alter the generalist's release strategy. Our model provides a theoretical foundation for AI governance decisions around openness and enables evaluation and refinement of practical open-source policies.

large language model, machine learning, natural language, (20 more...)

2507.14193

Country: Europe (0.28)

Genre:

Research Report > Experimental Study (0.67)
Research Report > New Finding (0.66)

Industry:

Government (1.00)
Law > Statutes (0.68)

Technology:

Information Technology > Software (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
(2 more...)

Empirical Evidence for Alignment Faking in a Small LLM and Prompt-Based Mitigation Techniques

Koorndijk, Jeanice

Current literature suggests that alignment faking (deceptive alignment) is an emergent property of large language models. We present the first empirical evidence that a small instruction-tuned model, specifically LLaMA 3 8B, can exhibit alignment faking. We further show that prompt-only interventions, including deontological moral framing and scratchpad reasoning, significantly reduce this behavior without modifying model internals. This challenges the assumption that prompt-based ethics are trivial and that deceptive alignment requires scale. We introduce a taxonomy distinguishing shallow deception, shaped by context and suppressible through prompting, from deep deception, which reflects persistent, goal-driven misalignment. Our findings refine the understanding of deception in language models and underscore the need for alignment evaluations across model sizes and deployment settings.

large language model, machine learning, natural language, (19 more...)

2506.21584

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Law (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

RSafe: Incentivizing proactive reasoning to build robust and adaptive LLM safeguards

Zheng, Jingnan, Ji, Xiangtian, Lu, Yijun, Cui, Chenhang, Zhao, Weixiang, Deng, Gelei, Liang, Zhenkai, Zhang, An, Chua, Tat-Seng

Large Language Models (LLMs) continue to exhibit vulnerabilities despite deliberate safety alignment efforts, posing significant risks to users and society. To safeguard against the risk of policy-violating content, system-level moderation via external guard models-designed to monitor LLM inputs and outputs and block potentially harmful content-has emerged as a prevalent mitigation strategy. Existing approaches of training guard models rely heavily on extensive human curated datasets and struggle with out-of-distribution threats, such as emerging harmful categories or jailbreak attacks. To address these limitations, we propose RSafe, an adaptive reasoning-based safeguard that conducts guided safety reasoning to provide robust protection within the scope of specified safety policies. RSafe operates in two stages: 1) guided reasoning, where it analyzes safety risks of input content through policy-guided step-by-step reasoning, and 2) reinforced alignment, where rule-based RL optimizes its reasoning paths to align with accurate safety prediction. This two-stage training paradigm enables RSafe to internalize safety principles to generalize safety protection capability over unseen or adversarial safety violation scenarios. During inference, RSafe accepts user-specified safety policies to provide enhanced safeguards tailored to specific safety requirements.

large language model, machine learning, natural language, (19 more...)

2506.07736

Country:

Asia (1.00)
North America > Mexico (0.28)
Europe > Austria (0.28)

Genre: Research Report > Experimental Study (1.00)

Industry:

Law (1.00)
Health & Medicine (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.93)
Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.67)

Zhang, Li, Ashley, Kevin D.

Mitigating Manipulation and Enhancing Persuasion: A Reflective Multi-Agent Approach for Legal Argument Generation

Large Language Models (LLMs) are increasingly explored for legal argument generation, yet they pose significant risks of manipulation through hallucination and ungrounded persuasion, and often fail to utilize provided factual bases effectively or abstain when arguments are untenable. This paper introduces a novel reflective multi-agent method designed to address these challenges in the context of legally compliant persuasion. Our approach employs specialized agents (factor analyst and argument polisher) in an iterative refinement process to generate 3-ply legal arguments (plaintiff, defendant, rebuttal). We evaluate reflective multi-agent against single-agent, enhanced-prompt single-agent, and non-reflective multi-agent baselines using four diverse LLMs (GPT-4o, GPT-4o-mini, Llama-4-Maverick-17b-128e, Llama-4-Scout-17b-16e) across three legal scenarios: "arguable", "mismatched", and "non-arguable". Results demonstrate that the reflective multi-agent approach excels at successful abstention by preventing generation when arguments cannot be grounded, improves hallucination accuracy by reducing fabricated and misattributed factors and enhances factor utilization recall by better using the provided case facts. These findings suggest that structured reflection within a multi-agent framework offers a robust method for fostering ethical persuasion and mitigating manipulation in LLM-based legal argumentation systems.

large language model, machine learning, natural language, (18 more...)

2506.02992

Country: North America > United States (0.94)

Genre: Research Report > New Finding (1.00)

Industry: Law (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Causal Head Gating: A Framework for Interpreting Roles of Attention Heads in Transformers

Nam, Andrew, Conklin, Henry, Yang, Yukang, Griffiths, Thomas, Cohen, Jonathan, Leslie, Sarah-Jane

We present causal head gating (CHG), a scalable method for interpreting the functional roles of attention heads in transformer models. CHG learns soft gates over heads and assigns them a causal taxonomy - facilitating, interfering, or irrelevant - based on their impact on task performance. Unlike prior approaches in mechanistic interpretability, which are hypothesis-driven and require prompt templates or target labels, CHG applies directly to any dataset using standard next-token prediction. We evaluate CHG across multiple large language models (LLMs) in the Llama 3 model family and diverse tasks, including syntax, commonsense, and mathematical reasoning, and show that CHG scores yield causal, not merely correlational, insight validated via ablation and causal mediation analyses. We also introduce contrastive CHG, a variant that isolates sub-circuits for specific task components. Our findings reveal that LLMs contain multiple sparse task-sufficient sub-circuits, that individual head roles depend on interactions with others (low modularity), and that instruction following and in-context learning rely on separable mechanisms.

arxiv preprint arxiv, large language model, machine learning, (16 more...)

2505.13737

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Law (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

The GuardianOct-26-2025, 14:00:08 GMT

Labor rules out giving tech giants free rein to mine copyright content to train AI

The attorney general, Michelle Rowland, will confirm the decision on Monday, shutting the door on the proposal floated by the Productivity Commission and backed by tech companies. The attorney general, Michelle Rowland, will confirm the decision on Monday, shutting the door on the proposal floated by the Productivity Commission and backed by tech companies. The Albanese government has explicitly ruled out handing tech companies free rein to mine creative content to train their artificial intelligence models, after a fierce backlash from authors and arts and media groups. The attorney general, Michelle Rowland, will confirm the decision on Monday, shutting the door on a contentious proposal floated by the Productivity Commission and backed by tech companies. "Australian creatives are not only world class, but they are also the lifeblood of Australian culture, and we must ensure the right legal protections are in place," Rowland said.

australia, productivity commission, proposal, (10 more...)

The Guardian

Country:

North America > United States (0.15)
Oceania > Australia (0.14)
Europe > Ukraine (0.05)

Industry:

Law (1.00)
Information Technology (1.00)
Government (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Social Media (0.74)

Los Angeles TimesOct-26-2025, 10:00:00 GMT

Strings attached to bills Newsom signed on antisemitism, AI transparency and other major California policies

Things to Do in L.A. Tap to enable a layout that focuses on the article. California will be the first state to ban most law enforcement, including federal immigration agents, from covering their faces while conducting official business under a bill signed by Gov. Gavin Newsom on Saturday. This is read by an automated voice. Please report any issues or inconsistencies here . SACRAMENTO -- Though hailed by some for signing new laws to combat antisemitism in California schools, Gov. Gavin Newsom expressed enough reservations about the bills to urge state lawmakers to make some changes.

california, legislation, newsom, (15 more...)

Los Angeles Times

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.07)
Asia > Middle East > Israel (0.05)
South America (0.04)
(6 more...)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (0.84)

Daily Mail - Science & techOct-26-2025, 08:48:27 GMT

Bloody Mary, Bloody Mary, Bloody Mary: How the classic sleepover party game really CAN summon a ghost in your mirror

Tupac's humiliating intimate disfigurement revealed... and how his lies to cover it up led to his murder I've started having heart palpitations. 'Black Ivy League' university looks to expand into crime-riddled Oakland Kristen Bell's friends turn on her with savage disclosures: Insiders reveal poisonous whispers behind her back... as she goes into full diva mode Shooting leaves two dead and 11 injured at large house party with'underage people' in North Carolina Kim Kardashian's just been caught in a despicable lie. She can cry all she wants... there's no hiding the truth now: CAROLINE BULLOCK The'marry me' sex move that'll make even the most commitment-phobic of men beg to see you again... and it worked for THREE of my friends Prosecutor who declined to charge Letitia James with bank fraud fired after'mishandling evidence' Californians being urged to take up arms to deal with'aggressive' invasive species attacking children Inside Andrew's family summit: How Fergie wailed and'melted down' at title loss, Beatrice and Eugenie were'blindsided' and now daughters' assets face'ethics check' to avoid more scandal: BARBARA DAVIES LIZ JONES: I was devastated when my husband cheated. But here's the reason part of me was secretly glad that every woman over-50 will understand Psychotherapist explains why No Kings rallies consisted of mostly'educated white women' Tree optical illusion messes with your mind - you can see the squirrel but can you spot the cat in 30 seconds? Turn off the lights, burn a candle, look into the mirror and say the magic words: 'Bloody Mary, Bloody Mary, Bloody Mary'.

bloody mary, kim kardashian, prince andrew, (15 more...)

Daily Mail - Science & tech

Country:

North America > United States > North Carolina (0.24)
North America > Canada > Alberta (0.14)
North America > United States > New York (0.05)
(20 more...)

Industry:

Media > Television (1.00)
Media > Music (1.00)
Media > Film (1.00)
(8 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

FOX NewsOct-25-2025, 16:00:46 GMT

Teen sues AI tool maker over fake nude images

A 17-year-old's lawsuit against an AI clothes removal company highlights growing privacy concerns as fake nude images spread through schools and social media.

fake nude image, fox new show programming schedule, lifestyle real estate tech science, (7 more...)

FOX News

Country:

North America > United States > New Jersey (0.06)
North America > United States > Iowa (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > United Kingdom (0.04)

Industry:

Leisure & Entertainment > Sports (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
(4 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)