Instructional Material
'Uncanny Valley': Donald Trump's Davos Drama, AI Midterms, and ChatGPT's Last Resort
On this episode of, our hosts unpack the news from Davos, where Trump and major AI companies shared the stage at the World Economic Forum. This week, WIRED's Brian Barrett and Leah Feiger are joining the show as the new cohosts, alongside Zoë Schiffer. And our attention has been drawn to the drama going down in the quaint little town of Davos. Zoë tells us how at the World Economic Forum's event, major AI players like Anthropic have been the protagonists--sharing the spotlight with President Donald Trump, who insists on invading Greenland. Brian has been looking at how ICE activity is developing, and Leah is forcing us to think about this year's midterms because tech giants are already pouring millions into it. Plus, we dive into why OpenAI's decision to roll out ads in ChatGPT was a long time coming. Ads Are Coming to ChatGPT. Write to us at uncannyvalley@wired.com . You can always listen to this week's podcast through the audio player on this page, but if you want to subscribe for free to get every episode, here's how: If you're on an iPhone or iPad, open the app called Podcasts, or just tap this link . Today, we're starting a bit of a new chapter here on the show, and I want to introduce you to my brand new cohost, Brian Barrett, our executive editor here at WIRED, and Leah Feiger, our senior politics editor. So thrilled to be here. So longtime listeners know the show has taken on a bunch of different formats since it launched. We had the Gadget Lab days, the roundtable, news episodes. We really created this podcast because we want to bring you the best stories and the best takes about what's happening in tech and politics. That's all going to stay the same, but this time we're going to go even deeper. What trends you should be watching for, the news that's already happened or about to break, and how we are thinking about all of it.
Statistical Reinforcement Learning in the Real World: A Survey of Challenges and Future Directions
Gazi, Asim H., Guo, Yongyi, Gao, Daiqi, Xu, Ziping, Zhang, Kelly W., Murphy, Susan A.
Reinforcement learning (RL) has achieved remarkable success in real-world decision-making across diverse domains, including gaming, robotics, online advertising, public health, and natural language processing. Despite these advances, a substantial gap remains between RL research and its deployment in many practical settings. Two recurring challenges often underlie this gap. First, many settings offer limited opportunity for the agent to interact extensively with the target environment due to practical constraints. Second, many target environments often undergo substantial changes, requiring redesign and redeployment of RL systems (e.g., advancements in science and technology that change the landscape of healthcare delivery). Addressing these challenges and bridging the gap between basic research and application requires theory and methodology that directly inform the design, implementation, and continual improvement of RL systems in real-world settings. In this paper, we frame the application of RL in practice as a three-component process: (i) online learning and optimization during deployment, (ii) post- or between-deployment offline analyses, and (iii) repeated cycles of deployment and redeployment to continually improve the RL system. We provide a narrative review of recent advances in statistical RL that address these components, including methods for maximizing data utility for between-deployment inference, enhancing sample efficiency for online learning within-deployment, and designing sequences of deployments for continual improvement. We also outline future research directions in statistical RL that are use-inspired -- aiming for impactful application of RL in practice.
Lego's latest educational kit seeks to teach AI as part of computer science, not to build a chatbot
Lego also recognized that it had to build a course that'll work regardless of a teacher's fluency in such subjects. So a big part of developing the course was making sure that teachers had the tools they needed to be on top of whatever lessons they're working on. "When we design and we test the products, we're not the ones testing in the classroom," Silwinski said. "We give it to a teacher and we provide all of the lesson materials, all of the training, all of the notes, all the presentation materials, everything that they need to be able to teach the lesson." Lego also took into account the fact that some schools might introduce its students to these things starting in Kindergarten, whereas others might skip to the grade 3-5 or 6-8 sets.
Dozens of ICE Vehicles in Minnesota Lack 'Necessary' Lights and Sirens
A contract justification published in a federal register on Tuesday says that 31 ICE vehicles operating in the Twin Cities area "lack the necessary emergency lights and sirens" to be "compliant." Paul area "currently lack the necessary emergency lights and sirens" required to be "compliant with law enforcement requirements," according to a contract justification published in a federal register on Tuesday. The document justifies ICE paying Whelen Engineering Company, a Connecticut-based firm specializing in "emergency warning and lighting technology," $47,330.49 The document explains that the ATLAS kits would "allow vehicles to be immediately operational and compliant with law enforcement requirements to support the current surge operation" out of Homeland Security Investigations (HSI)'s St. Paul office, which conducts operations in Minnesota, North Dakota, and South Dakota. "These vehicles were deployed prior to being permanently retrofitted and currently lack the necessary emergency lights and sirens required for operational use," the document says.
Start your 2026 Resolutions with a lifetime membership to Rosetta Stone's language learning program for just 149
Grab a lifetime membership and learn up to 25 languages instead of scrolling your life away in 2026. We may earn revenue from the products available on this page and participate in affiliate programs. Learning a new language is a very common resolution. It's useful, stimulating, and can even be fun if you choose the right method. Right now, Rosetta Stone has discounted its lifetime memberships, which means you can pay once and learn forever.
Robohub highlights 2025
Over the course of the year, we've had the pleasure of working with many talented researchers from across the globe. As 2025 draws to a close, we take a look back at some of the excellent blog posts, interviews and podcasts from our contributors. Jiahui Zhang and Jesse Zhang to tell us about their framework for learning robot manipulation tasks solely from language instructions without per-task demonstrations. Hui Zhang writes about work presented at CoRL2025 on RobustDexGrasp, a novel framework that tackles different grasping challenges with targeted solutions. In this podcast from AAAI, host Ella Lan asked Professor Marynel Vázquez about what inspired her research direction, how her perspective on human-robot interactions has changed over time, robots navigating the social world, and more.
3 New Tricks to Try With Google Gemini Live After Its Latest Major Upgrade
Google's AI is now even smarter, and more versatile. Gemini Live is the more conversational, natural language way of interacting with the Google Gemini AI bot using your voice. The idea is you chat with it like you would chat with a friend, interruptions and all, even if the actual answers are the same as you'd get from typing your queries into Gemini as normal. Now, about a year and a half after its debut, Gemini Live has been given what Google is describing as its "biggest update ever." The update makes the Gemini Live mode even more natural and even more conversational than before, with a better understanding of tone, nuance, pronunciation, and rhythm.
Video-Mined Task Graphs for Keystep Recognition in Instructional Videos
Procedural activity understanding requires perceiving human actions in terms of a broader task, where multiple keysteps are performed in sequence across a long video to reach a final goal state---such as the steps of a recipe or the steps of a DIY fix-it task. Prior work largely treats keystep recognition in isolation of this broader structure, or else rigidly confines keysteps to align with a particular sequential script. We propose discovering a task graph automatically from how-to videos to represent probabilistically how people tend to execute keysteps, then leverage this graph to regularize keystep recognition in novel videos. On multiple datasets of real-world instructional video, we show the impact: more reliable zero-shot keystep localization and improved video representation learning, exceeding the state of the art.
Causal Imitation for Markov Decision Processes: a Partial Identification Approach
Imitation learning enables an agent to learn from expert demonstrations when the performance measure is unknown and the reward signal is not specified. Standard imitation methods do not generally apply when the learner and the expert's sensory capabilities mismatch and demonstrations are contaminated with unobserved confounding bias. To address these challenges, recent advancements in causal imitation learning have been pursued. However, these methods often require access to underlying causal structures that might not always be available, posing practical challenges.In this paper, we investigate robust imitation learning within the framework of canonical Markov Decision Processes (MDPs) using partial identification, allowing the agent to achieve expert performance even when the system dynamics are not uniquely determined from the confounded expert demonstrations. Specifically, first, we theoretically demonstrate that when unobserved confounders (UCs) exist in an MDP, the learner is generally unable to imitate expert performance. We then explore imitation learning in partially identifiable settings --- either transition distribution or reward function is non-identifiable from the available data and knowledge. Augmenting the celebrated GAIL method (Ho \& Ermon, 2016), our analysis leads to two novel causal imitation algorithms that can obtain effective policies guaranteed to achieve expert performance.