uplift
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- (2 more...)
Biothreat Benchmark Generation Framework for Evaluating Frontier AI Models III: Implementing the Bacterial Biothreat Benchmark (B3) Dataset
Ackerman, Gary, Wilson, Theodore, Kallenborn, Zachary, Shoemaker, Olivia, Wetzel, Anna, Peterson, Hayley, Danfora, Abigail, LaTourette, Jenna, Behlendorf, Brandon, Clifford, Douglas
The potential for rapidly-evolving frontier artificial intelligence (AI) models, especially large language models (LLMs), to facilitate bioterrorism or access to biological weapons has generated significant policy, academic, and public concern. Both model developers and policymakers seek to quantify and mitigate any risk, with an important element of such efforts being the development of model benchmarks that can assess the biosecurity risk posed by a particular model. This paper discusses the pilot implementation of the Bacterial Biothreat Benchmark (B3) dataset. It is the third in a series of three papers describing an overall Biothreat Benchmark Generation (BBG) framework, with previous papers detailing the development of the B3 dataset. The pilot involved running the benchmarks through a sample frontier AI model, followed by human evaluation of model responses, and an applied risk analysis of the results along several dimensions. Overall, the pilot demonstrated that the B3 dataset offers a viable, nuanced method for rapidly assessing the biosecurity risk posed by a LLM, identifying the key sources of that risk and providing guidance for priority areas of mitigation priority.
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- (2 more...)
On the Mechanisms of Collaborative Learning in VAE Recommenders
Vuong, Tung-Long, Monteil, Julien, Dang, Hien, Vaskovych, Volodymyr, Le, Trung, Nguyen, Vu
Variational Autoencoders (VAEs) are a powerful alternative to matrix factorization for recommendation. A common technique in VAE-based collaborative filtering (CF) consists in applying binary input masking to user interaction vectors, which improves performance but remains underexplored theoretically. In this work, we analyze how collaboration arises in VAE-based CF and show it is governed by latent proximity: we derive a latent sharing radius that informs when an SGD update on one user strictly reduces the loss on another user, with influence decaying as the latent Wasserstein distance increases. We further study the induced geometry: with clean inputs, VAE-based CF primarily exploits \emph{local} collaboration between input-similar users and under-utilizes global collaboration between far-but-related users. We compare two mechanisms that encourage \emph{global} mixing and characterize their trade-offs: (1) $β$-KL regularization directly tightens the information bottleneck, promoting posterior overlap but risking representational collapse if too large; (2) input masking induces stochastic geometric contractions and expansions, which can bring distant users onto the same latent neighborhood but also introduce neighborhood drift. To preserve user identity while enabling global consistency, we propose an anchor regularizer that aligns user posteriors with item embeddings, stabilizing users under masking and facilitating signal sharing across related items. Our analyses are validated on the Netflix, MovieLens-20M, and Million Song datasets. We also successfully deployed our proposed algorithm on an Amazon streaming platform following a successful online experiment.
- Information Technology (0.48)
- Media > Film (0.34)
- Leisure & Entertainment (0.34)
Feature-driven reinforcement learning for photovoltaic in continuous intraday trading
Abate, Arega Getaneh, Liu, Xiufeng, Liu, Ruyu, Zhang, Xiaobing
Photovoltaic (PV) operators face substantial uncertainty in generation and short-term electricity prices. Continuous intraday markets enable producers to adjust their positions in real time, potentially improving revenues and reducing imbalance costs. We propose a feature-driven reinforcement learning (RL) approach for PV intraday trading that integrates data-driven features into the state and learns bidding policies in a sequential decision framework. The problem is cast as a Markov Decision Process with a reward that balances trading profit and imbalance penalties and is solved with Proximal Policy Optimization (PPO) using a predominantly linear, interpretable policy. Trained on historical market data and evaluated out-of-sample, the strategy consistently outperforms benchmark baselines across diverse scenarios. Extensive validation shows rapid convergence, real-time inference, and transparent decision rules. Learned weights highlight the central role of market microstructure and historical features. Taken together, these results indicate that feature-driven RL offers a practical, data-efficient, and operationally deployable pathway for active intraday participation by PV producers.
- Europe > Middle East > Cyprus > Limassol > Limassol (0.05)
- Europe > Germany (0.04)
- North America > Costa Rica > Heredia Province > Heredia (0.04)
- Europe > Denmark > Capital Region > Kongens Lyngby (0.04)
- Energy > Renewable > Solar (1.00)
- Energy > Power Industry (1.00)
- Banking & Finance > Trading (1.00)
Redefining CX with Agentic AI: Minerva CQ Case Study
Agrawal, Garima, De Maria, Riccardo, Davuluri, Kiran, Spera, Daniele, Read, Charlie, Spera, Cosimo, Garrett, Jack, Miller, Don
Despite advances in AI for contact centers, customer experience (CX) continues to suffer from high average handling time (AHT), low first-call resolution, and poor customer satisfaction (CSAT). A key driver is the cognitive load on agents, who must navigate fragmented systems, troubleshoot manually, and frequently place customers on hold. Existing AI-powered agent-assist tools are often reactive driven by static rules, simple prompting, or retrieval-augmented generation (RAG) without deeper contextual reasoning. We introduce Agentic AI goal-driven, autonomous, tool-using systems that proactively support agents in real time. Unlike conventional approaches, Agentic AI identifies customer intent, triggers modular workflows, maintains evolving context, and adapts dynamically to conversation state. This paper presents a case study of Minerva CQ, a real-time Agent Assist product deployed in voice-based customer support. Minerva CQ integrates real-time transcription, intent and sentiment detection, entity recognition, contextual retrieval, dynamic customer profiling, and partial conversational summaries enabling proactive workflows and continuous context-building. Deployed in live production, Minerva CQ acts as an AI co-pilot, delivering measurable improvements in agent efficiency and customer experience across multiple deployments.
- Asia > India (0.04)
- North America > United States > California (0.04)
- Asia > Middle East > Jordan (0.04)
- Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Segment Discovery: Enhancing E-commerce Targeting
Li, Qiqi, Singh, Roopali, Polpanumas, Charin, Fiez, Tanner, Kumar, Namita, Chakrabarti, Shreya
Popular promotions include discounts, bundled offers, free services, etc. By offering these promotions, companies aim to increase revenue and customer base, while also improving customer experience. However, such promotions usually incur a cost and can become unsustainable without any guardrails in place. A popular approach is to target customers with high or low propensity for desired behavior. For example, a retail company is likely to target customers who are at risk of leaving if they want to retain its customers by offering certain incentives. However, previous studies show that this strategy is ineffective and could be detrimental towards the company objectives [2, 6, 7]. Moreover, additional analysis needs to be done for the choice of propensity score threshold for targeting (e.g., target anyone whose propensity to leave is higher than 0.8), because the wrong threshold may lead to sub-optimal outcomes [2]. Each customer responds differently to the same promotion.
- North America > United States > New York > New York County > New York City (0.04)
- North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
- Asia > Japan (0.04)
Assessing the Impact of Upselling in Online Fantasy Sports
This study explores the impact of upselling on user engagement. We model users' deposit behaviour on the fantasy sports platform Dream11. Subsequently, we develop an experimental framework to evaluate the effect of upselling using an intensity parameter. Our live experiments on user deposit behaviour reveal decreased user recall with heightened upselling intensity. Our findings indicate that increased upselling intensity improves user deposit metrics and concurrently diminishes user satisfaction and conversion rates. We conduct robust counterfactual analysis and train causal meta-learners to personalise users' upselling intensity levels to reach an optimal trade-off point.
- North America > United States > Louisiana > East Baton Rouge Parish > Baton Rouge (0.06)
- North America > United States > New York > New York County > New York City (0.04)
- North America > Montserrat (0.04)
- (2 more...)
- Leisure & Entertainment > Sports (0.62)
- Leisure & Entertainment > Gambling (0.62)
This Political Startup Wants to Help Progressives Win … With AI-Generated Ads
Stories about AI-generated political content are like stories about people drunkenly setting off fireworks: There's a good chance they'll end in disaster. WIRED is tracking AI usage in political campaigns across the world, and so far examples include pornographic deepfakes and misinformation-spewing chatbots. It's gotten to the point where the US Federal Communications Commission has proposed mandatory disclosures for AI use in television and radio ads. Despite concerns, some US political campaigns are embracing generative AI tools. There's a growing category of AI-generated political content flying under the radar this election cycle, developed by startups including Denver-based BattlegroundAI, which uses generative AI to come up with digital advertising copy at a rapid clip.
- Government > Voting & Elections (0.53)
- Media > News (0.37)
- Government > Regional Government > North America Government > United States Government (0.37)