Law
Co-Investigator AI: The Rise of Agentic AI for Smarter, Trustworthy AML Compliance Narratives
Naik, Prathamesh Vasudeo, Dintakurthi, Naresh Kumar, Hu, Zhanghao, Wang, Yue, Qiu, Robby
Generating regulatorily compliant Suspicious Activity Report (SAR) remains a high-cost, low-scalability bottleneck in Anti-Money Laundering (AML) workflows. While large language models (LLMs) offer promising fluency, they suffer from factual hallucination, limited crime typology alignment, and poor explainability -- posing unacceptable risks in compliance-critical domains. This paper introduces Co-Investigator AI, an agentic framework optimized to produce Suspicious Activity Reports (SARs) significantly faster and with greater accuracy than traditional methods. Drawing inspiration from recent advances in autonomous agent architectures, such as the AI Co-Scientist, our approach integrates specialized agents for planning, crime type detection, external intelligence gathering, and compliance validation. The system features dynamic memory management, an AI-Privacy Guard layer for sensitive data handling, and a real-time validation agent employing the Agent-as-a-Judge paradigm to ensure continuous narrative quality assurance. Human investigators remain firmly in the loop, empowered to review and refine drafts in a collaborative workflow that blends AI efficiency with domain expertise. We demonstrate the versatility of Co-Investigator AI across a range of complex financial crime scenarios, highlighting its ability to streamline SAR drafting, align narratives with regulatory expectations, and enable compliance teams to focus on higher-order analytical work. This approach marks the beginning of a new era in compliance reporting -- bringing the transformative benefits of AI agents to the core of regulatory processes and paving the way for scalable, reliable, and transparent SAR generation.
The DOGE Subcommittee Hearing on Weather Modification Was a Nest of Conspiracy Theorizing
A House Oversight Committee hearing produced a flood of bizarre claims about cloud seeding, chemtrails, and solar geoengineering. Proven, human-driven changes to the weather were dismissed. "What this whole debate comes down to is who controls the skies," Republican representative Marjorie Taylor Greene of Georgia told the audience at a House Oversight Committee hearing on Tuesday. "Do we believe in God and that he has dominion over his perfect creation of planet Earth? Do we believe that he has given us everything we need to survive as a civilization since the beginning of time? Or do you believe in man's claim of authority over the weather, based on scientists that have only been alive for decades and weren't here to witness the climate changes since the beginning of time?"
Steven Pinker's new book shows how he's become a contradictory figure
Steven Pinker's new book shows how he's become a contradictory figure Steven Pinker's new book When Everyone Knows That Everyone Knows makes a compelling case for common knowledge. Steven Pinker argues that "cancel culture" is a form of censorship Steven Pinker's new book perfectly encapsulates what a contradictory figure he has become. Much of it is a clear, fascinating explanation of a major psychological phenomenon . But then he starts telling you what he thinks about current affairs. Pinker is a psychologist at Harvard University who has written a string of popular science books. Some, like Words and Rules, are rooted in his own research and are a good read.
ChatGPT developing age-verification system to identify under-18 users after teen death
OpenAI will restrict how ChatGPT responds to a user it suspects is under 18. OpenAI will restrict how ChatGPT responds to a user it suspects is under 18. Sam Altman said if there is doubt the system will default to the under-18 experience putting'safety ahead of privacy and freedom for teens' OpenAI will restrict how ChatGPT responds to a user it suspects is under 18, unless that user passes the company's age estimation technology or provides ID, after legal action from the family of a 16-year-old who killed himself in April after months of conversations with the chatbot. OpenAI was prioritising "safety ahead of privacy and freedom for teens", chief executive Sam Altman said in a blog post on Tuesday, stating "minors need significant protection". The company said that the way ChatGPT responds to a 15-year-old should look different to the way it responds to an adult.
Introducing the A2AJ's Canadian Legal Data: An open-source alternative to CanLII for the era of computational law
The Access to Algorithmic Justice project (A2AJ) is an open-source alternative to the Canadian Legal Information Institute (CanLII). At a moment when technology promises to enable new ways of working with law, CanLII is becoming an impediment to the free access of law and access to justice movements because it restricts bulk and programmatic access to Canadian legal data. This means that Canada is staring down a digital divide: well-resourced actors have the best new technological tools and, because CanLII has disclaimed leadership, the public only gets second-rate tools. This article puts CanLII in its larger historical context and shows how long and deep efforts to democratize access to Canadian legal data are, and how often they are thwarted by private industry. We introduce the A2AJ's Canadian Legal Data project, which provides open access to over 116,000 court decisions and 5,000 statutes through multiple channels including APIs, machine learning datasets, and AI integration protocols. Through concrete examples, we demonstrate how open legal data enables courts to conduct evidence-based assessments and allows developers to create tools for practitioners serving low-income communities.
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning
Li, Kuan, Zhang, Zhongwang, Yin, Huifeng, Ye, Rui, Zhao, Yida, Zhang, Liwen, Ou, Litu, Zhang, Dingchu, Wu, Xixi, Wu, Jialong, Wang, Xinyu, Qiao, Zile, Zhang, Zhen, Jiang, Yong, Xie, Pengjun, Huang, Fei, Zhou, Jingren
Transcending human cognitive limitations represents a critical frontier in LLM training. Proprietary agentic systems like DeepResearch have demonstrated superhuman capabilities on extremely complex information-seeking benchmarks such as BrowseComp, a feat previously unattainable. We posit that their success hinges on a sophisticated reasoning pattern absent in open-source models: the ability to systematically reduce extreme uncertainty when navigating vast information landscapes. Based on this insight, we introduce WebSailor, a complete post-training methodology designed to instill this crucial capability. Our approach involves generating novel, high-uncertainty tasks through structured sampling and information obfuscation, RFT cold start, and an efficient agentic RL training algorithm, Duplicating Sampling Policy Optimization (DUPO). With this integrated pipeline, WebSailor significantly outperforms all open-source agents in complex information-seeking tasks, matching proprietary agents' performance and closing the capability gap.
Podcasts as a Medium for Participation in Collective Action: A Case Study of Black Lives Matter
Moldovan, Theodora, Pera, Arianna, Vega, Davide, Aiello, Luca Maria
We study how participation in collective action is articulated in podcast discussions, using the Black Lives Matter (BLM) movement as a case study. While research on collective action discourse has primarily focused on text-based content, this study takes a first step toward analyzing audio formats by using podcast transcripts. Using the Structured Podcast Research Corpus (SPoRC), we investigated spoken language expressions of participation in collective action, categorized as problem-solution, call-to-action, intention, and execution. We identified podcast episodes discussing racial justice after important BLM-related events in May and June of 2020, and extracted participatory statements using a layered framework adapted from prior work on social media. We examined the emotional dimensions of these statements, detecting eight key emotions and their association with varying stages of activism. We found that emotional profiles vary by stage, with different positive emotions standing out during calls-to-action, intention, and execution. We detected negative associations between collective action and negative emotions, contrary to theoretical expectations. Our work contributes to a better understanding of how activism is expressed in spoken digital discourse and how emotional framing may depend on the format of the discussion.
Agentic AI for Financial Crime Compliance
Axelsen, Henrik, Licht, Valdemar, Damsgaard, Jan
The cost and complexity of financial crime compliance (FCC) continue to rise, often without measurable improvements in effectiveness. While AI offers potential, most solutions remain opaque and poorly aligned with regulatory expectations. This paper presents the design and deployment of an agentic AI system for FCC in digitally native financial platforms. Developed through an Action Design Research (ADR) process with a fintech firm and regulatory stakeholders, the system automates onboarding, monitoring, investigation, and reporting, emphasizing explainability, traceability, and compliance-by-design. Using artifact-centric modeling, it assigns clearly bounded roles to autonomous agents and enables task-specific model routing and audit logging. The contribution includes a reference architecture, a real-world prototype, and insights into how Agentic AI can reconfigure FCC workflows under regulatory constraints. Our findings extend IS literature on AI-enabled compliance by demonstrating how automation, when embedded within accountable governance structures, can support transparency and institutional trust in high-stakes, regulated environments.
Rethinking the Evaluation of Alignment Methods: Insights into Diversity, Generalisation, and Safety
Janiak, Denis, Moska, Julia, Motyka, Dawid, Seweryn, Karolina, Walkowiak, Paweł, Żuk, Bartosz, Janz, Arkadiusz
Large language models (LLMs) require careful alignment to balance competing objectives - factuality, safety, conciseness, proactivity, and diversity. Existing studies focus on individual techniques or specific dimensions, lacking a holistic assessment of the inherent trade-offs. We propose a unified evaluation framework that compares LLM alignment methods (PPO, DPO, ORPO, KTO) across these five axes, using both in-distribution and out-of-distribution datasets. Leveraging a specialized LLM-as-Judge prompt, validated through human studies, we reveal that DPO and KTO excel in factual accuracy, PPO and DPO lead in safety, and PPO best balances conciseness with proactivity. Our findings provide insights into trade-offs of common alignment methods, guiding the development of more balanced and reliable LLMs.