AITopics

2509.01909

Country: Europe (0.45)

Genre: Research Report > New Finding (0.45)

Industry:

Law > Criminal Law (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Information Technology > Security & Privacy (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

BBC NewsOct-14-2025, 23:19:53 GMT

ChatGPT will soon allow erotica for verified adults, says OpenAI boss

OpenAI plans to allow a wider range of content, including erotica, on its popular chatbot ChatGPT as part of its push to treat adult users like adults, says its boss Sam Altman. In a post on X on Tuesday, Mr Altman said upcoming versions of the popular chatbot would enable it to behave in a more human-like way - but only if you want it, not because we are usage maxxing. The move, reminiscent of Elon Musk's xAI recent introduction of two sexually explicit chatbots to Grok, could help OpenAI attract more paying subscribers. It is also likely to intensify pressure on lawmakers to introduce tighter restrictions on chatbot companions. OpenAI did not respond to the BBC's requests for comment following Mr Altman's post.

chatgpt, erotica, openai, (15 more...)

BBC News

Country:

South America (0.15)
North America > United States > California (0.15)
North America > Central America (0.15)
(13 more...)

Industry:

Law (1.00)
Government > Regional Government > North America Government > United States Government (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

FOX NewsOct-14-2025, 11:41:22 GMT

Federal judge fines, reprimands lawyer who used AI to draft court filings

U.S. District Judge Terry Moorer fined Alabama lawyer James Johnson $5,000 for using artificial intelligence to create fake case citations in court filings.

fox new show programming schedule, lifestyle real estate tech science, tech and electronic deal health, (7 more...)

FOX News

Country:

North America > United States > Alabama (0.26)
North America > United States > Tennessee (0.05)
North America > United States > Massachusetts > Suffolk County > Boston (0.05)
(2 more...)

Industry:

Leisure & Entertainment > Sports (1.00)
Law (1.00)
Government > Regional Government > North America Government > United States Government (0.69)

Technology:

Information Technology > Artificial Intelligence > Applied AI (1.00)
Information Technology > Communications > Social Media (0.76)

Daily Mail - Science & techOct-14-2025, 10:50:48 GMT

Spot the difference: Apple has rebranded its TV service as part of a 'vibrant new identity' - so, can you see what has changed?

Hamas executes'collaborators' in Gaza as it clings to power amid fears Trump's peace deal is already at risk Internet star who demanded free seats for fat fliers vanished without trace... now the Daily Mail has learned the heartbreaking reason why Donald Trump tells crowds there are world leaders he'doesn't like at ALL' as he teases who they are How Diane Keaton's closest friend helped her to achieve her'lifelong ambition' just months before she died - and the poignant legacy it leaves Kate and Wills' fresh start at their'forever home': Why they have fast-tracked their move to house they will never leave - even when he becomes King'It's Meghan Markle 3.0': Why the duchess has set tongues wagging that she's plotting another Sussex relaunch'as she holds cosy meeting with new editor of US Vogue' Trump's ominous warning to Macron at Egypt summit: 'You will see what is about to happen' Neil Diamond, 84, sang Sweet Caroline and worked with Cher as well as Barbra Streisand... see him now Insiders reveal how reluctant Katy Perry finally gave in to'persistent' Justin Trudeau... as sexy yacht photos get spicy response from his ex-wife Awkward moment Donald Trump asks Giorgia Meloni'You won't be offended if I say you're beautiful, right? Horrors endured by Israel's last 20 hostages: Chained, tortured, and starved. Lindsey Halligan removes senior DOJ official after taking over Virginia US attorney's office Gorgeous Bay Area enclave filled with hippies becomes America's ANGRIEST town over plans for huge affordable housing project MLB fans hail'greatest play in baseball HISTORY' after Dodgers thought they hit grand slam in Brewers game Father launches campaign to become sheriff as he faces murder trial for killing teenage daughter's abuser Spot the difference: Apple has rebranded its TV service as part of a'vibrant new identity' - so, can you see what has changed? But Apple TV+ is no more - as Apple has quietly rebranded its streaming service. 'Apple TV+ is now simply Apple TV, with a vibrant new identity,' the tech giant explained in the bottom of a press release on the streaming debut of its film, 'F1 The Movie'.

apple, diane keaton, vibrant new identity, (13 more...)

Daily Mail - Science & tech

Country:

North America > United States > Virginia (0.24)
Asia > Middle East > Palestine > Gaza Strip > Gaza Governorate > Gaza (0.24)
Asia > Middle East > Israel (0.24)
(19 more...)

Genre: Personal (1.00)

Industry:

Media (1.00)
Leisure & Entertainment (1.00)
Law (1.00)
(2 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Mobile (0.69)

MIT Technology ReviewOct-14-2025, 10:00:00 GMT

Can we repair the internet?

Can we repair the internet? Three new books propose remedies that run the gamut from government regulation to user responsibility. From addictive algorithms to exploitative apps, data mining to misinformation, the internet today can be a hazardous place. Books by three influential figures--the intellect behind "net neutrality," a former Meta executive, and the web's own inventor--propose radical approaches to fixing it. But are these luminaries the right people for the job? Though each shows conviction, and even sometimes inventiveness, the solutions they present reveal blind spots.

big tech, clegg, internet, (16 more...)

MIT Technology Review

Country:

Europe > United Kingdom (0.14)
Oceania > Australia (0.04)
North America > United States > Massachusetts (0.04)
(2 more...)

Genre: Summary/Review (0.70)

Industry:

Law (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Information Technology > Services (0.95)
(2 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.30)

When Thinking Backfires: Mechanistic Insights Into Reasoning-Induced Misalignment

Yan, Hanqi, Xu, Hainiu, Qi, Siya, Yang, Shu, He, Yulan

With the growing accessibility and wide adoption of large language models, concerns about their safety and alignment with human values have become paramount. In this paper, we identify a concerning phenomenon: Reasoning-Induced Misalignment (RIM), in which misalignment emerges when reasoning capabilities strengthened-particularly when specific types of reasoning patterns are introduced during inference or training. Beyond reporting this vulnerability, we provide the first mechanistic account of its origins. Through representation analysis, we discover that specific attention heads facilitate refusal by reducing their attention to CoT tokens, a mechanism that modulates the model's rationalization process during inference. During training, we find significantly higher activation entanglement between reasoning and safety in safety-critical neurons than in control neurons, particularly after fine-tuning with those identified reasoning patterns. This entanglement strongly correlates with catastrophic forgetting, providing a neuron-level explanation for RIM.

large language model, machine learning, natural language, (18 more...)

2509.00544

Country:

Europe > Austria > Vienna (0.14)
North America > United States > Florida > Miami-Dade County > Miami (0.04)
Europe > Portugal > Lisbon > Lisbon (0.04)

Genre: Research Report (1.00)

Industry:

Education (0.67)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.67)
Law > Criminal Law (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.88)

Smith, Henry D., Diamant, Nathaniel L., Trippe, Brian L.

Calibrating Generative Models

arXiv.org Machine LearningOct-14-2025

Generative models frequently suffer miscalibration, wherein class probabilities and other statistics of the sampling distribution deviate from desired values. We frame calibration as a constrained optimization problem and seek the closest model in Kullback-Leibler divergence satisfying calibration constraints. To address the intractability of imposing these constraints exactly, we introduce two surrogate objectives for fine-tuning: (1) the relax loss, which replaces the constraint with a miscalibration penalty, and (2) the reward loss, which converts calibration into a reward fine-tuning problem. We demonstrate that these approaches substantially reduce calibration error across hundreds of simultaneous constraints and models with up to one billion parameters, spanning applications in protein design, image generation, and language modeling.

constraint, machine learning, natural language, (20 more...)

arXiv.org Machine Learning

2510.1002

Country:

Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report (0.63)

Industry:

Law (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Revisiting Trust in the Era of Generative AI: Factorial Structure and Latent Profiles

Sun, Haocan, Liu, Weizi, Wu, Di, Yu, Guoming, Yao, Mike

Trust is one of the most important factors shaping whether and how people adopt and rely on artificial intelligence (AI). Yet most existing studies measure trust in terms of functionality, focusing on whether a system is reliable, accurate, or easy to use, while giving less attention to the social and emotional dimensions that are increasingly relevant for today's generative AI (GenAI) systems. These systems do not just process information; they converse, respond, and collaborate with users, blurring the line between tool and partner. In this study, we introduce and validate the Human-AI Trust Scale (HAITS), a new measure designed to capture both the rational and relational aspects of trust in GenAI. Drawing on prior trust theories, qualitative interviews, and two waves of large-scale surveys in China and the United States, we used exploratory (n = 1,546) and confirmatory (n = 1,426) factor analyses to identify four key dimensions of trust: Affective Trust, Competence Trust, Benevolence & Integrity, and Perceived Risk. We then applied latent profile analysis to classify users into six distinct trust profiles, revealing meaningful differences in how affective-competence trust and trust-distrust frameworks coexist across individuals and cultures. Our findings offer a validated, culturally sensitive tool for measuring trust in GenAI and provide new insight into how trust evolves in human-AI interaction. By integrating instrumental and relational perspectives of trust, this work lays the foundation for more nuanced research and design of trustworthy AI systems.

artificial intelligence, machine learning, natural language, (20 more...)

2510.10199

Country: North America > United States > Illinois (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Education (0.67)
Information Technology > Security & Privacy (0.67)
Law > Statutes (0.46)
Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
(2 more...)

ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems

Gui, Xin, Zhu, King, Ren, JinCheng, Chen, Qianben, Wang, Zekun Moore, LI, Yizhi, Liu, Xinpeng, Li, Xiaowan, Ren, Wenli, Miao, Linyu, Qin, Tianrui, Shu, Ziqi, Zhu, He, Tang, Xiangru, Shi, Dingfeng, Liu, Jiaheng, Jiang, Yuchen Eleanor, Liu, Minghao, Zhang, Ge, Zhou, Wangchunshu

In recent years, the research focus of large language models (LLMs) and agents has shifted increasingly from demonstrating novel capabilities to complex reasoning and tackling challenging tasks. However, existing evaluations focus mainly on math/code contests or general tasks, while existing multi-domain academic benchmarks lack sufficient reasoning depth, leaving the field without a rigorous benchmark for high-level reasoning. To fill this gap, we introduce the Acadreason benchmark, designed to evaluate the ability of LLMs and agents to acquire and reason over academic knowledge. It consists of 50 expert-annotated academic problems across five high-reasoning domains, including computer science, economics, law, mathematics, and philosophy. All questions are sourced from top-tier publications in recent years and undergo rigorous annotation and quality control to ensure they are both challenging and answerable. We conduct systematic evaluations of over 10 mainstream LLMs and agents. The results show that most LLMs scored below 20 points, with even the cutting-edge GPT-5 achieving only 16 points. While agents achieved higher scores, none exceeded 40 points. This demonstrates the current capability gap between LLMs and agents in super-intelligent academic research tasks and highlights the challenges of Acadreason.

benchmark, large language model, machine learning, (18 more...)

2510.11652

Genre: Research Report > New Finding (0.87)

Industry: Law (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

High-Power Training Data Identification with Provable Statistical Guarantees

Liu, Zhenlong, Zeng, Hao, Huang, Weiran, Wei, Hongxin

The conventional approaches treat it as a simple binary classification task without statistical guarantees. A recent approach is designed to control the false discovery rate (FDR), but its guarantees rely on strong, easily violated assumptions. In this paper, we introduce Provable Training Data Identification (PTDI), a rigorous method that identifies a set of training data with strict false discovery rate (FDR) control. Specifically, our method computes p-values for each data point using a set of known unseen data, and then constructs a conservative estimator for the data usage proportion of the test set, which allows us to scale these p-values. Our approach then selects the final set of training data by identifying all points whose scaled p-values fall below a data-dependent threshold. This entire procedure enables the discovery of training data with provable, strict FDR control and significantly boosted power. Extensive experiments across a wide range of models (LLMs and VLMs), and datasets demonstrate that PTDI strictly controls the FDR and achieves higher power. These concerns raise the importance of identifying a specific, well-defined set of data allegedly used in training. To resolve such high-stakes disputes, claims must be supported by credible evidence that strictly controls the risk of false positives. This underscores the need for methods that provide rigorous statistical guarantees for identifying training data.

large language model, machine learning, natural language, (20 more...)

2510.09717

Country:

North America > United States (1.00)
Europe (0.93)
Asia (0.93)

Genre: Research Report > Experimental Study (0.77)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Government > Regional Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)