AITopics | Law

Collaborating Authors

Law

Expanding the WMT24++ Benchmark with Rumantsch Grischun, Sursilvan, Sutsilvan, Surmiran, Puter, and Vallader

Vamvas, Jannis, Prat, Ignacio Pérez, Soliva, Not Battesta, Baltermia-Guetg, Sandra, Beeli, Andrina, Beeli, Simona, Capeder, Madlaina, Decurtins, Laura, Gregori, Gian Peder, Hobi, Flavia, Holderegger, Gabriela, Lazzarini, Arina, Lazzarini, Viviana, Rosselli, Walter, Vital, Bettina, Rutkiewicz, Anna, Sennrich, Rico

arXiv.org Artificial IntelligenceSep-25-2025

The Romansh language, spoken in Switzerland, has limited resources for machine translation evaluation. In this paper, we present a benchmark for six varieties of Romansh: Rumantsch Grischun, a supra-regional variety, and five regional varieties: Sursilvan, Sutsilvan, Surmiran, Puter, and Vallader. Our reference translations were created by human translators based on the WMT24++ benchmark, which ensures parallelism with more than 55 other languages. An automatic evaluation of existing MT systems and LLMs shows that translation out of Romansh into German is handled relatively well for all the varieties, but translation into Romansh is still challenging.

large language model, machine learning, translation, (22 more...)

arXiv.org Artificial Intelligence

2509.03148

Country:

Europe > Switzerland (0.34)
Europe > Austria (0.28)
Europe > Germany (0.28)

Genre: Research Report (0.50)

Industry: Law (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.75)

Add feedback

The Medium Is Not the Message: Deconfounding Document Embeddings via Linear Concept Erasure

Fan, Yu, Tian, Yang, Ravfogel, Shauli, Sachan, Mrinmaya, Ash, Elliott, Hoyle, Alexander

arXiv.org Artificial IntelligenceSep-25-2025

Embedding-based similarity metrics between text sequences can be influenced not just by the content dimensions we most care about, but can also be biased by spurious attributes like the text's source or language. These document confounders cause problems for many applications, but especially those that need to pool texts from different corpora. This paper shows that a debiasing algorithm that removes information about observed confounders from the encoder representations substantially reduces these biases at a minimal computational cost. Document similarity and clustering metrics improve across every embedding variant and task we evaluate -- often dramatically. Interestingly, performance on out-of-distribution benchmarks is not impacted, indicating that the embeddings are not otherwise degraded.

computational linguistic, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2507.01234

Country:

Europe (1.00)
Asia (1.00)
North America > United States > Minnesota (0.28)

Genre: Research Report > New Finding (0.93)

Industry:

Law > Government & the Courts (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Banking & Finance (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)

Add feedback

Identities are not Interchangeable: The Problem of Overgeneralization in Fair Machine Learning

Wang, Angelina

arXiv.org Artificial IntelligenceSep-25-2025

A key value proposition of machine learning is generalizability: the same methods and model architecture should be able to work across different domains and different contexts. While powerful, this generalization can sometimes go too far, and miss the importance of the specifics. In this work, we look at how fair machine learning has often treated as interchangeable the identity axis along which discrimination occurs. In other words, racism is measured and mitigated the same way as sexism, as ableism, as ageism. Disciplines outside of computer science have pointed out both the similarities and differences between these different forms of oppression, and in this work we draw out the implications for fair machine learning. While certainly not all aspects of fair machine learning need to be tailored to the specific form of oppression, there is a pressing need for greater attention to such specificity than is currently evident. Ultimately, context specificity can deepen our understanding of how to build more fair systems, widen our scope to include currently overlooked harms, and, almost paradoxically, also help to narrow our scope and counter the fear of an infinite number of group-specific methods of analysis.

artificial intelligence, discrimination, machine learning, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3715275.3732033

2505.04038

Country: North America > United States (1.00)

Genre: Research Report (0.40)

Industry:

Law > Labor & Employment Law (1.00)
Law > Civil Rights & Constitutional Law (1.00)
Information Technology (0.93)
(3 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

How retirees can stop fake debt collector scams

FOX NewsSep-24-2025, 14:44:27 GMT

Retirees face growing threats from scammers posing as debt collectors who demand payment through gift cards and refuse to provide written verification of debts.

lifestyle real estate tech science, personal information, retiree, (6 more...)

FOX News

Country:

North America > United States > Montana (0.04)
North America > United States > California (0.04)
Europe > Germany (0.04)

Industry:

Media (1.00)
Leisure & Entertainment > Sports (1.00)
Law (1.00)
(5 more...)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (0.98)

Add feedback

0764db1151b936aca59249e2c1386101-Paper-Conference.pdf

Neural Information Processing SystemsSep-24-2025, 11:38:09 GMT

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America > United States > California > Los Angeles County (0.67)

Genre: Personal > Interview (0.45)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
(8 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.96)

Add feedback

Were you convinced by the Rapture? You're probably ARROGANT: People who believe in conspiracy theories are 'massively overconfident', study finds

Daily Mail - Science & techSep-24-2025, 11:24:03 GMT

Ben Affleck and Jennifer Garner's daughter Violet emotionally advocates for mask mandates and children with long COVID at United Nations event Jimmy Kimmel weeps while saying he'never intended' to'make light of' Charlie Kirk's death - but DOESN'T apologize as he hits out at Trump If Trump isn't careful, he will end up no better than Biden! This dirty revenge tour must cease... before everyone loses: DAN MCLAUGHLIN Jimmy Kimmel's comeback descends into chaos: Staff turn on host over'sh***y' behavior... as'betrayal rumor' runs rife backstage Charlie Kirk suspect's trans lover has VANISHED: Shaken neighbors share fresh fears... as new photos show abandoned home Jimmy Kimmel's return BLASTED by Roseanne Barr seven years after ABC fired her: 'Double standard' I'm the doctor on the cusp of an autism breakthrough... we're using an everyday $2.50 pill to reverse children's symptoms Dancing with the Stars drama explodes: Cast are'miserable'... concerned family say smiles on screen are FAKE... and producers are forced to issue'warning' The world's best burgers REVEALED - and London bags nearly half of the top ten spots (but number one will surprise you) I was a devout Catholic... until I died. Moment daughter of Trump's would-be assassin Ryan Routh LOSES IT outside of court after father convicted of trying to kill president Sarah Ferguson claims she was trying to protect Princesses Beatrice and Eugenie when she sent apology email to Jeffrey Epstein'as her children come first' The View co-host makes cheeky immigration crack about Kamala Harris' Miami book tour stop SARAH VINE: The striking similarities between Sarah Ferguson and Meghan... and why Fergie's downfall should be a red flag for the Sussexes Chappell Roan'accidentally' reveals derrière onstage: 'I forgot my bottom was just a thong' Kim Kardashian takes a pop at Kanye as she poses topless for Vogue: 'I gained confidence three years ago... before, I always needed to check with someone before making decisions' Were you convinced by the Rapture? You're probably ARROGANT: People who believe in conspiracy theories are'massively overconfident', study finds READ MORE: Devout Christians take drastic action as'The Rapture' approaches Thousands of people around the world woke up yesterday morning hoping it would be their last day on Earth. The'Rapture' was a theory put forward by a South African pastor, claiming that Jesus would return to Earth on September 23, causing his followers to rise into the sky to meet him.

conspiracy theory, jimmy kimmel, rapture, (12 more...)

Daily Mail - Science & tech

Country:

North America > Canada > Alberta (0.14)
North America > United States > Texas (0.04)
North America > United States > New York (0.04)
(17 more...)

Genre: Research Report (0.93)

Industry:

Media > Television (1.00)
Media > Film (1.00)
Leisure & Entertainment (1.00)
(3 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Communications > Mobile (0.69)
Information Technology > Artificial Intelligence (0.68)

Add feedback

0a9747136d411fb83f0cf81820d44afb-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsSep-24-2025, 10:07:05 GMT

This problem is called the "Riemann problem", and the initial discontinuity

artificial intelligence, equation, machine learning, (18 more...)

Neural Information Processing Systems

Country: Europe (0.28)

Genre: Research Report (0.46)

Industry:

Law (1.00)
Energy > Oil & Gas > Upstream (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

LogicGuard: Improving Embodied LLM agents through Temporal Logic based Critics

Gokhale, Anand, Srivastava, Vaibhav, Bullo, Francesco

arXiv.org Artificial IntelligenceSep-24-2025

Large language models (LLMs) have shown promise in zero-shot and single step reasoning and decision making problems, but in long horizon sequential planning tasks, their errors compound, often leading to unreliable or inefficient behavior. We introduce LogicGuard, a modular actor-critic architecture in which an LLM actor is guided by a trajectory level LLM critic that communicates through Linear Temporal Logic (LTL). Our setup combines the reasoning strengths of language models with the guarantees of formal logic. The actor selects high-level actions from natural language observations, while the critic analyzes full trajectories and proposes new LTL constraints that shield the actor from future unsafe or inefficient behavior. LogicGuard supports both fixed safety rules and adaptive, learned constraints, and is model-agnostic: any LLM-based planner can serve as the actor, with LogicGuard acting as a logic-generating wrapper. We formalize planning as graph traversal under symbolic constraints, allowing LogicGuard to analyze failed or suboptimal trajectories and generate new temporal logic rules that improve future behavior. To demonstrate generality, we evaluate LogicGuard across two distinct settings: short-horizon general tasks and long-horizon specialist tasks. On the Behavior benchmark of 100 household tasks, LogicGuard increases task completion rates by 25% over a baseline InnerMonologue planner. On the Minecraft diamond-mining task, which is long-horizon and requires multiple interdependent subgoals, LogicGuard improves both efficiency and safety compared to SayCan and InnerMonologue. These results show that enabling LLMs to supervise each other through temporal logic yields more reliable, efficient and safe decision-making for both embodied agents.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2507.03293

Country: Europe (0.28)

Genre: Research Report > New Finding (0.48)

Industry:

Law (0.93)
Materials > Metals & Mining > Diamonds (0.66)
Materials > Metals & Mining > Iron (0.48)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Are Vision-Language Models Safe in the Wild? A Meme-Based Benchmark Study

Lee, DongGeon, Jang, Joonwon, Jeong, Jihae, Yu, Hwanjo

arXiv.org Artificial IntelligenceSep-24-2025

Rapid deployment of vision-language models (VLMs) magnifies safety risks, yet most evaluations rely on artificial images. This study asks: How safe are current VLMs when confronted with meme images that ordinary users share? To investigate this question, we introduce MemeSafetyBench, a 50,430-instance benchmark pairing real meme images with both harmful and benign instructions. Using a comprehensive safety taxonomy and LLM-based instruction generation, we assess multiple VLMs across single and multi-turn interactions. We investigate how real-world memes influence harmful outputs, the mitigating effects of conversational context, and the relationship between model scale and safety metrics. Our findings demonstrate that VLMs are more vulnerable to meme-based harmful prompts than to synthetic or typographic images. Memes significantly increase harmful responses and decrease refusals compared to text-only inputs. Though multi-turn interactions provide partial mitigation, elevated vulnerability persists. These results highlight the need for ecologically valid evaluations and stronger safety mechanisms. MemeSafetyBench is publicly available at https://github.com/oneonlee/Meme-Safety-Bench.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2505.15389

Country:

North America > United States (1.00)
Europe (1.00)
Asia (1.00)

Genre: Research Report > New Finding (1.00)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Can Global XAI Methods Reveal Injected Bias in LLMs? SHAP vs Rule Extraction vs RuleSHAP

Sovrano, Francesco

arXiv.org Artificial IntelligenceSep-24-2025

Large language models (LLMs) can amplify misinformation, undermining societal goals like the UN SDGs. We study three documented drivers of misinformation (valence framing, information overload, and oversimplification) which are often shaped by one's default beliefs. Building on evidence that LLMs encode such defaults (e.g., "joy is positive," "math is complex") and can act as "bags of heuristics," we ask: can general belief-driven heuristics behind misinformative behaviour be recovered from LLMs as clear rules? A key obstacle is that global rule-extraction methods in explainable AI (XAI) are built for numerical inputs/outputs, not text. We address this by eliciting global LLM beliefs and mapping them to numerical scores via statistically reliable abstractions, thereby enabling off-the-shelf global XAI to detect belief-related heuristics in LLMs. To obtain ground truth, we hard-code bias-inducing nonlinear heuristics of increasing complexity (univariate, conjunctive, nonconvex) into popular LLMs (ChatGPT and Llama) via system instructions. This way, we find that RuleFit under-detects non-univariate biases, while global SHAP better approximates conjunctive ones but does not yield actionable rules. To bridge this gap, we propose RuleSHAP, a rule-extraction algorithm that couples global SHAP-value aggregations with rule induction to better capture non-univariate bias, improving heuristics detection over RuleFit by +94% (MRR@1) on average. Our results provide a practical pathway for revealing belief-driven biases in LLMs.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2505.11189

Country:

North America > United States (1.00)
Europe (0.92)

Genre: Research Report > New Finding (1.00)

Industry:

Law (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.97)

Add feedback