AITopics | Law

Collaborating Authors

Law

The Surprising Effectiveness of Membership Inference with Simple N-Gram Coverage

Hallinan, Skyler, Jung, Jaehun, Sclar, Melanie, Lu, Ximing, Ravichander, Abhilasha, Ramnath, Sahana, Choi, Yejin, Karimireddy, Sai Praneeth, Mireshghallah, Niloofar, Ren, Xiang

arXiv.org Artificial IntelligenceAug-14-2025

Membership inference attacks serves as useful tool for fair use of language models, such as detecting potential copyright infringement and auditing data leakage. However, many current state-of-the-art attacks require access to models' hidden states or probability distribution, which prevents investigation into more widely-used, API-access only models like GPT-4. In this work, we introduce N-Gram Coverage Attack, a membership inference attack that relies solely on text outputs from the target model, enabling attacks on completely black-box models. We leverage the observation that models are more likely to memorize and subsequently generate text patterns that were commonly observed in their training data. Specifically, to make a prediction on a candidate member, N-Gram Coverage Attack first obtains multiple model generations conditioned on a prefix of the candidate. It then uses n-gram overlap metrics to compute and aggregate the similarities of these outputs with the ground truth suffix; high similarities indicate likely membership. We first demonstrate on a diverse set of existing benchmarks that N-Gram Coverage Attack outperforms other black-box methods while also impressively achieving comparable or even better performance to state-of-the-art white-box attacks - despite having access to only text outputs. Interestingly, we find that the success rate of our method scales with the attack compute budget - as we increase the number of sequences generated from the target model conditioned on the prefix, attack performance tends to improve. Having verified the accuracy of our method, we use it to investigate previously unstudied closed OpenAI models on multiple domains. We find that more recent models, such as GPT-4o, exhibit increased robustness to membership inference, suggesting an evolving trend toward improved privacy protections.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2508.09603

Country: Asia (0.28)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Law > Intellectual Property & Technology Law (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Collective dynamics of strategic classification

Couto, Marta C., Barsotti, Flavia, Santos, Fernando P.

arXiv.org Artificial IntelligenceAug-14-2025

Classification algorithms based on Artificial Intelligence (AI) are nowadays applied in high-stakes decisions in finance, healthcare, criminal justice, or education. Individuals can strategically adapt to the information gathered about classifiers, which in turn may require algorithms to be re-trained. Which collective dynamics will result from users' adaptation and algorithms' retraining? We apply evolutionary game theory to address this question. Our framework provides a mathematically rigorous way of treating the problem of feedback loops between collectives of users and institutions, allowing to test interventions to mitigate the adverse effects of strategic adaptation. As a case study, we consider institutions deploying algorithms for credit lending. We consider several scenarios, each representing different interaction paradigms. When algorithms are not robust against strategic manipulation, we are able to capture previous challenges discussed in the strategic classification literature, whereby users either pay excessive costs to meet the institutions' expectations (leading to high social costs) or game the algorithm (e.g., provide fake information). From this baseline setting, we test the role of improving gaming detection and providing algorithmic recourse. We show that increased detection capabilities reduce social costs and could lead to users' improvement; when perfect classifiers are not feasible (likely to occur in practice), algorithmic recourse can steer the dynamics towards high users' improvement rates. The speed at which the institutions re-adapt to the user's population plays a role in the final outcome. Finally, we explore a scenario where strict institutions provide actionable recourse to their unsuccessful users and observe cycling dynamics so far unnoticed in the literature.

artificial intelligence, machine learning, scenario, (19 more...)

arXiv.org Artificial Intelligence

2508.0934

Country:

Europe (1.00)
North America > United States (0.68)

Genre: Research Report (1.00)

Industry:

Banking & Finance (1.00)
Law (0.87)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)

Add feedback

Fairness of Automatic Speech Recognition: Looking Through a Philosophical Lens

Choi, Anna Seo Gyeong, Choi, Hoon

arXiv.org Artificial IntelligenceAug-14-2025

Automatic Speech Recognition (ASR) systems now mediate countless human-technology interactions, yet research on their fairness implications remains surprisingly limited. This paper examines ASR bias through a philosophical lens, arguing that systematic misrecognition of certain speech varieties constitutes more than a technical limitation -- it represents a form of disrespect that compounds historical injustices against marginalized linguistic communities. We distinguish between morally neutral classification (discriminate1) and harmful discrimination (discriminate2), demonstrating how ASR systems can inadvertently transform the former into the latter when they consistently misrecognize non-standard dialects. We identify three unique ethical dimensions of speech technologies that differentiate ASR bias from other algorithmic fairness concerns: the temporal burden placed on speakers of non-standard varieties ("temporal taxation"), the disruption of conversational flow when systems misrecognize speech, and the fundamental connection between speech patterns and personal/cultural identity. These factors create asymmetric power relationships that existing technical fairness metrics fail to capture. The paper analyzes the tension between linguistic standardization and pluralism in ASR development, arguing that current approaches often embed and reinforce problematic language ideologies. We conclude that addressing ASR bias requires more than technical interventions; it demands recognition of diverse speech varieties as legitimate forms of expression worthy of technological accommodation. This philosophical reframing offers new pathways for developing ASR systems that respect linguistic diversity and speaker autonomy.

artificial intelligence, discrimination, speech recognition, (12 more...)

arXiv.org Artificial Intelligence

2508.07143

Country: North America > United States (1.00)

Genre: Research Report (1.00)

Industry:

Law (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Health & Medicine > Therapeutic Area (0.93)

Technology: Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)

Add feedback

Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision Zhiqing Sun

Neural Information Processing SystemsAug-13-2025, 20:53:40 GMT

Principle Engraving: In the third stage, we fine-tune the original LLM (the base model) on the self-aligned responses, generated by the LLM itself through prompting, while pruning the principles and demonstrations for the fine-tuned model.

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country: North America > United States > California > Los Angeles County (0.67)

Genre: Personal > Interview (0.45)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
(8 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.96)

Add feedback

0a9747136d411fb83f0cf81820d44afb-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsAug-13-2025, 18:23:57 GMT

This problem is called the "Riemann problem", and the initial discontinuity

artificial intelligence, equation, machine learning, (18 more...)

Neural Information Processing Systems

Country: Europe (0.28)

Genre: Research Report (0.46)

Industry:

Law (1.00)
Energy > Oil & Gas > Upstream (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Elon Musk and Sam Altman's AI Feud Gets Nasty

TIME - TechAug-13-2025, 16:52:01 GMT

A long-running feud between Elon Musk and Sam Altman spilled out into the open this week as the AI billionaire heavyweights publicly fought over their rival companies. The latest round in the battle between the X CEO and the CEO of OpenAI began when Musk claimed that Apple had been favoring Altman's AI app over his own in the Apple Store rankings. "Apple is behaving in a manner that makes it impossible for any AI company besides OpenAI to reach #1 in the App Store, which is an unequivocal antitrust violation," Musk said on X on Monday evening. "xAI will take immediate legal action," he added, referring to the AI company he leads. "Hey @Apple App Store, why do you refuse to put either X or Grok in your'Must Have' section when X is the #1 news app in the world and Grok is #5 among all apps?" he asked.

altman, elon musk and sam altman, musk, (7 more...)

TIME - Tech

Country: Africa > South Africa (0.06)

Industry:

Law (0.57)
Information Technology (0.37)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.72)

Add feedback

DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks

Neural Information Processing SystemsAug-13-2025, 16:18:44 GMT

With more than 39 TB of publicly available engineering data, DrivAerNet++ fills a significant gap in available resources, providing high-quality, diverse data to enhance model training, promote generalization, and accelerate automotive design processes.

artificial intelligence, data mining, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe > Germany (0.28)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
Europe > Netherlands (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Law (0.92)
(3 more...)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)

Add feedback

Concerning the Responsible Use of AI in the U.S. Criminal Justice System

Communications of the ACMAug-13-2025, 13:46:59 GMT

Artificial intelligence (AI) is advancing quickly and is being adopted in most industries. Using AI to draft an email message or check your grammar is typically not a cause for concern, but using it to make decisions that affect people's lives is another matter. When constitutional rights are involved, as in the justice system, transparency is paramount. During the Biden-Harris administration, Executive Order 14110 directed agencies to develop guidelines for acceptable uses and regulation of AI. Some of these uses, like summarizing and notetaking, will occur across the government.

artificial intelligence, justice system, machine learning, (16 more...)

Communications of the ACM

Country: North America > United States > California (0.04)

Industry:

Law > Criminal Law (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Government > Regional Government > North America Government > United States Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.76)
Information Technology > Artificial Intelligence > Applied AI (0.72)

Add feedback

Live facial recognition is 'worrying for our democracy', experts warn as the government expands the 'Orwellian' system across Britain

Daily Mail - Science & techAug-13-2025, 11:17:38 GMT

Experts have warned of a'frightening expansion' of'Orwellian' technology as the government expands the use of live facial recognition across the country. Ten vans equipped with facial recognition cameras will be deployed across seven police forces – Greater Manchester, West Yorkshire, Bedfordshire, Surrey, Sussex, Thames Valley and Hampshire. The Home Office maintains that this technology will only be used to catch'high–harm' offenders with rules to ensure'safeguards and oversight'. According to the government, the technology has already been used to make 580 arrests in London over the last year, including 52 registered sex offenders. However, rights groups have raised concerns that the unprecedented rollout of this surveillance technology risks becoming overly intrusive.

facial recognition, government, recognition, (13 more...)

Daily Mail - Science & tech

Country:

Europe > United Kingdom > England > West Yorkshire (0.26)
Europe > United Kingdom > Wales (0.07)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law (1.00)
Government > Regional Government > Europe Government > United Kingdom Government (0.40)

Technology: Information Technology > Artificial Intelligence > Vision > Face Recognition (1.00)

Add feedback

YouTube to Start Using AI to Estimate Users' Ages. Here's What to Know

TIME - TechAug-13-2025, 09:00:00 GMT

YouTube is one of the most popular online platforms in the U.S. among all age groups. But not all content on the video-sharing site is appropriate for all ages. While the platform, like most, has restrictions on certain content, such as violence and nudity, for users under 18, these safeguards have in the past been easy for young users to circumvent by entering an older birthdate on their account. But now, the company is rolling out an artificial intelligence-powered tool to estimate a user's age based on their activity on the platform "and then use that signal, regardless of the birthday in the account, to deliver our age-appropriate product experiences and protection," said James Beser, director of product management at YouTube Youth, in blog post last month. The technology, according to Beser, has been used in other markets "for some time" and will begin being tested in the U.S. on Wednesday before a wider rollout.

platform, protection, youtube, (12 more...)

TIME - Tech

Country:

Oceania > Australia (0.05)
North America > United States > Texas (0.05)
Europe > United Kingdom (0.05)

Industry:

Law (0.99)
Government (0.73)
Information Technology > Security & Privacy (0.66)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback