AITopics

2505.14945

Country: North America > United States > California (0.28)

Genre: Research Report > New Finding (0.46)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)

Issak, Alayt, Narayan, Uttkarsh, Srinivasan, Ramya, Kleinman, Erica, Harteveld, Casper

Kaleidoscope Gallery: Exploring Ethics and Generative AI Through Art

arXiv.org Artificial IntelligenceMay-22-2025

Ethical theories and Generative AI (GenAI) models are dynamic concepts subject to continuous evolution. This paper investigates the visualization of ethics through a subset of GenAI models. We expand on the emerging field of Visual Ethics, using art as a form of critical inquiry and the metaphor of a kaleidoscope to invoke moral imagination. Through formative interviews with 10 ethics experts, we first establish a foundation of ethical theories. Our analysis reveals five families of ethical theories, which we then transform into images using the text-to-image (T2I) GenAI model. The resulting imagery, curated as Kaleidoscope Gallery and evaluated by the same experts, revealed eight themes that highlight how morality, society, and learned associations are central to ethical theories. We discuss implications for critically examining T2I models and present cautions and considerations. This work contributes to examining ethical theories as foundational knowledge that interrogates GenAI models as socio-technical systems.

artificial intelligence, machine learning, natural language, (17 more...)

2505.14758

Country:

North America > United States (1.00)
Europe > United Kingdom > England (0.28)

Genre:

Questionnaire & Opinion Survey (1.00)
Personal > Interview (0.93)
Research Report > New Finding (0.68)

Industry: Law (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

arXiv.org Artificial IntelligenceMay-22-2025

The Evolution of Alpha in Finance Harnessing Human Insight and LLM Agents

Islam, Mohammad Rubyet

The pursuit of alpha returns that exceed market benchmarks has undergone a profound transformation, evolving from intuition-driven investing to autonomous, AI powered systems. This paper introduces a comprehensive five stage taxonomy that traces this progression across manual strategies, statistical models, classical machine learning, deep learning, and agentic architectures powered by large language models (LLMs). Unlike prior surveys focused narrowly on modeling techniques, this review adopts a system level lens, integrating advances in representation learning, multimodal data fusion, and tool augmented LLM agents. The strategic shift from static predictors to contextaware financial agents capable of real time reasoning, scenario simulation, and cross modal decision making is emphasized. Key challenges in interpretability, data fragility, governance, and regulatory compliance areas critical to production deployment are examined. The proposed taxonomy offers a unified framework for evaluating maturity, aligning infrastructure, and guiding the responsible development of next generation alpha systems.

large language model, machine learning, natural language, (17 more...)

2505.14727

Country: North America > United States > Maryland (0.28)

Genre:

Overview (0.48)
Research Report (0.40)

Industry:

Law (1.00)
Banking & Finance > Trading (1.00)
Banking & Finance > Economy (0.68)
Government > Regional Government > North America Government > United States Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

SlateMay-21-2025, 19:30:00 GMT

Cannes Is Rolling Out the Red Carpet for One of This Century's Most Controversial Figures

Although the Cannes Film Festival is the world's most prestigious movie showcase, its spotlight rarely falls on nonfiction film. Years go by without a single documentary competing for its biggest honor, the Palme d'Or, and there is no separate documentary prize. Juliette Binoche, the president of this year's jury, devoted part of her opening-night remarks to Fatma Hassona, the Palestinian photojournalist who was killed in an Israeli airstrike the day after it was announced that her documentary Put Your Soul on Your Hand and Walk would be premiering at Cannes. But the film itself was slotted into a low-profile sidebar devoted to independent productions. The festival did, however, roll out the red carpet for The Six Billion Dollar Man, Eugene Jarecki's portrait of WikiLeaks founder Julian Assange, which premiered out of competition on Wednesday evening.

assange, controversial figure, movie, (13 more...)

Slate

Country:

North America > United States (1.00)
Europe > Sweden (0.15)
Asia > Middle East > Iraq (0.06)
(4 more...)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Law > Civil Rights & Constitutional Law (1.00)
(2 more...)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > The Future (0.40)
Information Technology > Artificial Intelligence > Science Fiction (0.40)
Information Technology > Artificial Intelligence > History (0.40)

The GuardianMay-21-2025, 04:00:16 GMT

'Every person that clashed with him has left': the rise, fall and spectacular comeback of Sam Altman

The short-lived firing of Sam Altman, the CEO of possibly the world's most important AI company, was sensational. When he was sacked by OpenAI's board members, some of them believed the stakes could not have been higher – the future of humanity – if the organisation continued under Altman. Imagine Succession, with added apocalypse vibes. In early November 2023, after three weeks of secret calls and varying degrees of paranoia, the OpenAI board agreed: Altman had to go. After his removal, Altman's most loyal staff resigned, and others signed an open letter calling for his reinstatement.

altman, hao, openai, (14 more...)

The Guardian

Country:

North America > United States > California > San Francisco County > San Francisco (0.05)
South America > Colombia (0.04)
South America > Chile (0.04)
Africa > Kenya (0.04)

Industry:

Law (1.00)
Information Technology > Security & Privacy (0.47)
Health & Medicine > Consumer Health (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.79)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.65)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.64)

Fuck the Algorithm: Conceptual Issues in Algorithmic Bias

Stinson, Catherine

Algorithmic bias has been the subject of much recent controversy. To clarify what is at stake and to make progress resolving the controversy, a better understanding of the concepts involved would be helpful. The discussion here focuses on the disputed claim that algorithms themselves cannot be biased. To clarify this claim we need to know what kind of thing 'algorithms themselves' are, and to disambiguate the several meanings of 'bias' at play. This further involves showing how bias of moral import can result from statistical biases, and drawing connections to previous conceptual work about political artifacts and oppressive things. Data bias has been identified in domains like hiring, policing and medicine. Examples where algorithms themselves have been pinpointed as the locus of bias include recommender systems that influence media consumption, academic search engines that influence citation patterns, and the 2020 UK algorithmically-moderated A-level grades. Recognition that algorithms are a kind of thing that can be biased is key to making decisions about responsibility for harm, and preventing algorithmically mediated discrimination.

artificial intelligence, information management, machine learning, (19 more...)

2505.13509

Country:

North America > Canada > Ontario > Toronto (0.04)
North America > Canada > Ontario > Kingston (0.04)
Oceania > New Zealand > South Island > Canterbury > Christchurch (0.04)
(2 more...)

Genre: Research Report (0.82)

Industry:

Law > Civil Rights & Constitutional Law (1.00)
Health & Medicine > Therapeutic Area (1.00)
Education > Educational Setting (1.00)
(3 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Information Management > Search (0.88)

Germani, Federico, Spitale, Giovanni

Source framing triggers systematic evaluation bias in Large Language Models

Large Language Models (LLMs) are increasingly used not only to generate text but also to evaluate it, raising urgent questions about whether their judgments are consistent, unbiased, and robust to framing effects. In this study, we systematically examine inter - and intra - model agreement across four state - of - the - art LLMs - OpenAI o3 - mini, Deepseek Reasone r, xAI Grok 2, and Mistral - tasked with evaluating 4,800 narrative statements on 24 different topics of social, political, and public health relevance, for a total of 192,000 assessments. W e manipulate the disclosed source of each statement to assess how attribution to either another LLM or a human author of specified nationality affects evaluation outcomes. We find that, in the blind condition, different LLMs display a remarkably high degree of inter - and intra - model agreement across topics . However, this alignment breaks down when source framing is introduced. Here we show that attributing statements to Chinese individuals systematically lowers agreement scores across all models, and in particular for Deepseek Reasoner . Our findings reveal that framing effects can deeply affect text evaluation, with significant implications for the integrity, neutrality, and fairness of LLM - mediated information systems.

deepseekr, large language model, machine learning, (18 more...)

2505.13488

Country:

Asia > China (1.00)
Asia > Russia (0.27)
Asia > Taiwan (0.14)
(12 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.93)

Industry:

Law (1.00)
Government > Regional Government > Asia Government > China Government (1.00)
Health & Medicine > Therapeutic Area (0.70)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Legal Rule Induction: Towards Generalizable Principle Discovery from Analogous Judicial Precedents

Fan, Wei, Zheng, Tianshi, Hu, Yiran, Deng, Zheye, Wang, Weiqi, Xu, Baixuan, Li, Chunyang, Li, Haoran, Shen, Weixing, Song, Yangqiu

Legal rules encompass not only codified statutes but also implicit adjudicatory principles derived from precedents that contain discretionary norms, social morality, and policy. While computational legal research has advanced in applying established rules to cases, inducing legal rules from judicial decisions remains understudied, constrained by limitations in model inference efficacy and symbolic reasoning capability. The advent of Large Language Models (LLMs) offers unprecedented opportunities for automating the extraction of such latent principles, yet progress is stymied by the absence of formal task definitions, benchmark datasets, and methodologies. To address this gap, we formalize Legal Rule Induction (LRI) as the task of deriving concise, generalizable doctrinal rules from sets of analogous precedents, distilling their shared preconditions, normative behaviors, and legal consequences. We introduce the first LRI benchmark, comprising 5,121 case sets (38,088 Chinese cases in total) for model tuning and 216 expert-annotated gold test sets. Experimental results reveal that: 1) State-of-the-art LLMs struggle with over-generalization and hallucination; 2) Training on our dataset markedly enhances LLMs capabilities in capturing nuanced rule patterns across similar cases.

large language model, machine learning, natural language, (18 more...)

2505.14104

Country:

Asia (1.00)
North America > United States (0.93)
Europe (0.67)

Genre: Research Report (0.81)

Industry:

Law > Litigation (1.00)
Law > Criminal Law (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.93)
Education > Educational Setting (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Garbacea, Cristina, Tan, Chenhao

HyPerAlign: Interpretable Personalized LLM Alignment via Hypothesis Generation

Alignment algorithms are widely used to align large language models (LLMs) to human users based on preference annotations. Typically these (often divergent) preferences are aggregated over a diverse set of users, resulting in fine-tuned models that are aligned to the ``average-user'' preference. Nevertheless, current models are used by individual users in very specific contexts and situations, emphasizing the need for user-dependent preference control. In this work we address the problem of personalizing LLM outputs to their users. We aim to generate customized responses tailored to specific individuals instead of generic outputs that emulate the collective voices of diverse populations. We propose HyPerAlign, an interpretable and sample-efficient hypothesis-driven personalization approach for LLM models. Given few-shot examples written by a particular user, we first infer hypotheses about their communication strategies, personality, and writing style, then prompt LLM models with these hypotheses and user-specific attributes to generate customized outputs. We conduct experiments on two different personalization tasks, namely authorship attribution and deliberative alignment, with datasets from diverse domains (news articles, blog posts, emails, jailbreaking benchmarks). Results demonstrate the superiority of hypothesis-driven LLM personalization compared to preference-based fine-tuning methods. For authorship attribution, HyPerAlign generations have consistently high win-rates (commonly $> 90\%$) against state-of-the-art preference fine-tuning approaches across diverse user profiles and LLM models. For deliberative alignment, the helpfulness of LLM models is improved by up to $70\%$ on average. Overall, HyPerAlign represents an interpretable and sample-efficient strategy for the personalization of LLM models to individual users.

large language model, machine learning, natural language, (21 more...)

2505.00038

Country: Asia (0.46)

Genre: Research Report > New Finding (0.34)

Industry:

Law > Civil Rights & Constitutional Law (1.00)
Banking & Finance (1.00)
Law Enforcement & Public Safety (0.93)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas

Chiu, Yu Ying, Wang, Zhilin, Maiya, Sharan, Choi, Yejin, Fish, Kyle, Levine, Sydney, Hubinger, Evan

Detecting AI risks becomes more challenging as stronger models emerge and find novel methods such as Alignment Faking to circumvent these detection attempts. Inspired by how risky behaviors in humans (i.e., illegal activities that may hurt others) are sometimes guided by strongly-held values, we believe that identifying values within AI models can be an early warning system for AI's risky behaviors. We create LitmusValues, an evaluation pipeline to reveal AI models' priorities on a range of AI value classes. Then, we collect AIRiskDilemmas, a diverse collection of dilemmas that pit values against one another in scenarios relevant to AI safety risks such as Power Seeking. By measuring an AI model's value prioritization using its aggregate choices, we obtain a self-consistent set of predicted value priorities that uncover potential risks. We show that values in LitmusValues (including seemingly innocuous ones like Care) can predict for both seen risky behaviors in AIRiskDilemmas and unseen risky behaviors in HarmBench.

large language model, machine learning, natural language, (20 more...)

2505.14633

Country: North America > United States (0.28)

Genre: Research Report > Promising Solution (0.87)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)