Personal
Former Scale AI CEO Alexandr Wang on AI's Potential and Its 'Deficiencies'
On June 12, Alexandr Wang stepped down as Scale's CEO to chase his most ambitious moonshot yet: building smarter-than-human AI as head of Meta's new "superintelligence" division. As part of his move, Meta will invest 14.3 billion for a minority stake in Scale AI, but the real prize isn't his company--it's Wang himself. Wang, 28, is expected to bring a sense of urgency to Meta's AI efforts, which this year have been plagued by delays and underwhelming performance. Once the undisputed leader of open-weight AI, the U.S. tech giant has been overtaken by Chinese rivals like DeepSeek on popular benchmarks. Although Wang, who dropped out of MIT at 19, lacks the academic chops of some of his peers, he offers both insight into the types of data Meta's rivals use to improve their AI systems, and unrivaled ambition.
This May Be Trump's Most Consequential Decision Yet
This week, Emily Bazelon, John Dickerson, and David Plotz discuss whether the US should join Israel's war on Iran, the tragic Minnesota assassinations and why US political violence is surging now, and the Supreme Court's unsurprising but willfully obtuse decision to uphold Tennessee's youth transgender care ban. Here are some notes and references from this week's show: Alexander Ward, Lara Seligman, and Dustin Volz for The Wall Street Journal (Exclusive): Israel Built Its Case for War With Iran on New Intelligence. The U.S. Didn't Buy It. Thomas L. Friedman for The New York Times (Opinion): The Smart Way for Trump to End the Israel-Iran War Oren Cass for Understanding America (Substack): Is Israel the Ideal "America First" Ally? Warren P. Strobel, Alex Horton, and Abigail Hauslohner for the Washington Post: Navigating Iran crisis, Trump relies on experience over star power Amy Howe for SCOTUSblog: Court upholds Tennessee's ban on certain medical treatments for transgender minors Abbie VanSickle for The New York Times: Sotomayor Writes the Court'Abandons' Transgender Children to'Political Whims' Ella Lee for The Hill: Clarence Thomas urges courts to end deferring to'experts' on gender-affirming care Ian Millhiser for Vox: The Supreme Court's incoherent new attack on trans rights, explained Here are this week's chatters: Emily: A Family Matter by Claire Lynch; The Fall of Affirmative Action: Race, the Supreme Court, and the Future of Higher Education by Justin Driver; A Flower Traveled in My Blood: The Incredible True Story of the Grandmothers Who Fought to Find a Stolen Generation of Children by Haley Cohen Gilliland. John: Mary Cunningham for CBS News: Federal Reserve holds its benchmark interest rate steady at today's FOMC meeting; ABA Banking Journal: Fed's Powell says some areas of U.S. may be'uninsurable' in next decade David: Trip Gabriel for the New York Times: William Langewiesche, the'Steve McQueen of Journalism,' Dies at 70 For this week's Slate Plus bonus episode, Emily, John, and David discuss the exciting possibilities and likely limitations of using AI tools for historical research and writing.
The Hardness of Achieving Impact in AI for Social Impact Research: A Ground-Level View of Challenges & Opportunities
Majumdar, Aditya, Zhang, Wenbo, Prawal, Kashvi, Yadav, Amulya
In an attempt to tackle the UN SDGs, AI for Social Impact (AI4SI) projects focus on harnessing AI to address societal issues in areas such as healthcare, social justice, etc. Unfortunately, despite growing interest in AI4SI, achieving tangible, on-the-ground impact remains a significant challenge. For example, identifying and engaging motivated collaborators who are willing to co-design and deploy AI based solutions in real-world settings is often difficult. Even when such partnerships are established, many AI4SI projects "fail" to progress beyond the proof-of-concept stage, and hence, are unable to transition to at-scale production-level solutions. Furthermore, the unique challenges faced by AI4SI researchers are not always fully recognized within the broader AI community, where such work is sometimes viewed as primarily applied and not aligning with the traditional criteria for novelty emphasized in core AI venues. This paper attempts to shine a light on the diverse challenges faced in AI4SI research by diagnosing a multitude of factors that prevent AI4SI partnerships from achieving real-world impact on the ground. Drawing on semi-structured interviews with six leading AI4SI researchers - complemented by the authors' own lived experiences in conducting AI4SI research - this paper attempts to understand the day-to-day difficulties faced in developing and deploying socially impactful AI solutions. Through thematic analysis, we identify structural and organizational, communication, collaboration, and operational challenges as key barriers to deployment. While there are no easy fixes, we synthesize best practices and actionable strategies drawn from these interviews and our own work in this space. In doing so, we hope this paper serves as a practical reference guide for AI4SI researchers and partner organizations seeking to engage more effectively in socially impactful AI collaborations.
Mapping Caregiver Needs to AI Chatbot Design: Strengths and Gaps in Mental Health Support for Alzheimer's and Dementia Caregivers
Shi, Jiayue Melissa, Yoo, Dong Whi, Wang, Keran, Rodriguez, Violeta J., Karkar, Ravi, Saha, Koustuv
Family caregivers of individuals with Alzheimer's Disease and Related Dementia (AD/ADRD) face significant emotional and logistical challenges that place them at heightened risk for stress, anxiety, and depression. Although recent advances in generative AI -- particularly large language models (LLMs) -- offer new opportunities to support mental health, little is known about how caregivers perceive and engage with such technologies. To address this gap, we developed Carey, a GPT-4o-based chatbot designed to provide informational and emotional support to AD/ADRD caregivers. Using Carey as a technology probe, we conducted semi-structured interviews with 16 family caregivers following scenario-driven interactions grounded in common caregiving stressors. Through inductive coding and reflexive thematic analysis, we surface a systemic understanding of caregiver needs and expectations across six themes -- on-demand information access, emotional support, safe space for disclosure, crisis management, personalization, and data privacy. For each of these themes, we also identified the nuanced tensions in the caregivers' desires and concerns. We present a mapping of caregiver needs, AI chatbot's strengths, gaps, and design recommendations. Our findings offer theoretical and practical insights to inform the design of proactive, trustworthy, and caregiver-centered AI systems that better support the evolving mental health needs of AD/ADRD caregivers.
Amazon Rebuilt Alexa Using a 'Staggering' Amount of AI Tools
Daniel Rausch, Amazon's vice president of Alexa and Echo, is in the midst of a major transition. More than a decade beyond the launch of Amazon's Alexa, he's been tasked with creating a new version of the marquee voice assistant, one that's powered by large language models. As he put it in my interview with him, this new assistant, dubbed Alexa, is "a complete rebuild of the architecture." How did his team approach Amazon's largest ever revamp of its voice assistant? They used AI to build AI, of course.
Gearing up for RoboCupJunior: Interview with Ana Patrรญcia Magalhรฃes
The annual RoboCup event, where teams gather from across the globe to take part in competitions across a number of leagues, will this year take place in Brazil, from 15-21 July. An important part of the week is RoboCupJunior, which is designed to introduce RoboCup to school children, and sees hundreds of kids taking part in a variety of challenges across different leagues. This year, the lead organizer for RoboCupJunior is Ana Patrรญcia Magalhรฃes. We caught up with her to find out how the preparations are going, what to expect at this year's competition, and how RoboCup inspires communities. RoboCup will take place from 15-21 July, in Salvador, Brazil.
A Systematic Review of User-Centred Evaluation of Explainable AI in Healthcare
Donoso-Guzmรกn, Ivania, Kacafรญrkovรก, Kristรฝna Sirka, Szymanski, Maxwell, Jacobs, An, Parra, Denis, Verbert, Katrien
Despite promising developments in Explainable Artificial Intelligence, the practical value of XAI methods remains under-explored and insufficiently validated in real-world settings. Robust and context-aware evaluation is essential, not only to produce understandable explanations but also to ensure their trustworthiness and usability for intended users, but tends to be overlooked because of no clear guidelines on how to design an evaluation with users. This study addresses this gap with two main goals: (1) to develop a framework of well-defined, atomic properties that characterise the user experience of XAI in healthcare; and (2) to provide clear, context-sensitive guidelines for defining evaluation strategies based on system characteristics. We conducted a systematic review of 82 user studies, sourced from five databases, all situated within healthcare settings and focused on evaluating AI-generated explanations. The analysis was guided by a predefined coding scheme informed by an existing evaluation framework, complemented by inductive codes developed iteratively. The review yields three key contributions: (1) a synthesis of current evaluation practices, highlighting a growing focus on human-centred approaches in healthcare XAI; (2) insights into the interrelations among explanation properties; and (3) an updated framework and a set of actionable guidelines to support interdisciplinary teams in designing and implementing effective evaluation strategies for XAI systems tailored to specific application contexts.
Controlling Context: Generative AI at Work in Integrated Circuit Design and Other High-Precision Domains
Moss, Emanuel, Watkins, Elizabeth, Persaud, Christopher, Karunaratne, Passant, Nafus, Dawn
Generative AI tools have become more prevalent in engineering workflows, particularly through chatbots and code assistants. As the perceived accuracy of these tools improves, questions arise about whether and how those who work in high-precision domains might maintain vigilance for errors, and what other aspects of using such tools might trouble their work. This paper analyzes interviews with hardware and software engineers, and their collaborators, who work in integrated circuit design to identify the role accuracy plays in their use of generative AI tools and what other forms of trouble they face in using such tools. The paper inventories these forms of trouble, which are then mapped to elements of generative AI systems, to conclude that controlling the context of interactions between engineers and the generative AI tools is one of the largest challenges they face. The paper concludes with recommendations for mitigating this form of trouble by increasing the ability to control context interactively.
American citizen killed in Russian attack on Kyiv, State Department confirms
A U.S. citizen died during a Russian missile attack on the Ukrainian capital of Kyiv, the State Department confirmed Tuesday afternoon. An American citizen was among the 15 killed in Russian drone and missile strikes on the Ukrainian capital city, Kyiv, on Tuesday, State Department spokesperson Tammy Bruce confirmed in a press conference Wednesday. In response to a reporter's question on U.S. diplomats in Kyiv having to spend the night in a bunker, Bruce said "we can confirm the death of a U.S. citizen in Ukraine." "We are aware of last night's attack on Kyiv that resulted in numerous casualties, including the tragic death of a U.S. citizen," she said, noting, "We condemn those strikes and extend our deepest condolences to the victims and to the families of all those affected." Bruce did not offer any more details on the identity of the citizen killed by the Russian strikes, citing "respect to the family during this obviously horrible time."
Exploring the Potential of Metacognitive Support Agents for Human-AI Co-Creation
Gmeiner, Frederic, Luo, Kaitao, Wang, Ye, Holstein, Kenneth, Martelaro, Nikolas
Despite the potential of generative AI (GenAI) design tools to enhance design processes, professionals often struggle to integrate AI into their workflows. Fundamental cognitive challenges include the need to specify all design criteria as distinct parameters upfront (intent formulation) and designers' reduced cognitive involvement in the design process due to cognitive offloading, which can lead to insufficient problem exploration, underspecification, and limited ability to evaluate outcomes. Motivated by these challenges, we envision novel metacognitive support agents that assist designers in working more reflectively with GenAI. To explore this vision, we conducted exploratory prototyping through a Wizard of Oz elicitation study with 20 mechanical designers probing multiple metacognitive support strategies. We found that agent-supported users created more feasible designs than non-supported users, with differing impacts between support strategies. Based on these findings, we discuss opportunities and tradeoffs of metacognitive support agents and considerations for future AI-based design tools.