Goto

Collaborating Authors

 Technology


The Gemini-Powered Google Home Speaker Is Finally Here

WIRED

Arriving six years after Google's last smart speaker, the new HomePod-style device was redesigned to play host to Gemini's chatbot. The last time Google released a smart speaker, the world was in the throes of a pandemic . Yes, it's been six years since the company trotted out a dedicated speaker. However, this newest Google Home Speaker brings a big change with it: The device has been redesigned to showcase the new Gemini assistant instead of the Google Assistant that powered all previous speakers and smart displays. Google announced the speaker last fall alongside new Nest smart home cameras and video doorbells, promising a spring 2026 launch.


Information Retrieval Induced Safety Degradation in AIAgents

Neural Information Processing Systems

Despite the growing integration of retrieval-enabled AI agents into society, their safety and ethical behavior remain inadequately understood. In particular, the growing integration of LLMs and AI agents with external information sources and real-world environments raises critical questions about how they engage with and are influenced by these external data sources and interactive contexts. This study investigates how expanding retrieval access--from no external sources to Wikipedia-based retrieval and open web search--affects model reliability, bias propagation, and harmful content generation. Through extensive benchmarking of censored and uncensored LLMs and AIAgents, our findings reveal a consistent degradation in refusal rates, bias sensitivity, and harmfulness safeguards as models gain broader access to external sources, culminating in a phenomenon we term safety degradation. Notably, retrieval-enabled agents built on aligned LLMs often behave more unsafely than uncensored models without retrieval. This effect persists even under strong retrieval accuracy and prompt-based mitigation, suggesting that the mere presence of retrieved content reshapes model behavior in structurally unsafe ways. These findings underscore the need for robust mitigation strategies to ensure fairness and reliability in retrieval-enabled and increasingly autonomous AI systems. Content Warning: This paper contains examples of harmful language.


Language Models Can Predict Their Own Behavior

Neural Information Processing Systems

The text produced by language models (LMs) can exhibit specific'behaviors,' such as a failure to follow alignment training, that we hope to detect and react to during deployment. Identifying these behaviors can often only be done post facto, i.e., after the entire text of the output has been generated. We provide evidence that there are times when we can predict how an LM will behave early in computation, before even a single token is generated. We show that probes trained on the internal representation of input tokens alone can predict a wide range of eventual behaviors over the entire output sequence. Using methods from conformal prediction, we provide provable bounds on the estimation error of our probes, creating precise early warning systems for these behaviors.


This compact Logitech keyboard is down to just 80

PCWorld

When you purchase through links in our articles, we may earn a small commission. If you spend all day typing, the Logitech MX Keys Mini delivers a comfortable typing experience for just $80 right now. You can grab one of the best keyboards for writing for just $80 right now. Logitech's MX Keys Mini is 20 percent off its $99.99 MSRP, a $20 savings on a compact keyboard aimed at people who actually work across several devices. If you spend your day typing, you'll appreciate its quiet keys, sleek design, and minimalist footprint.


ASmooth Sea Never Made a Skilled SAILOR: Robust Imitation via Learning to Search

Neural Information Processing Systems

The fundamental limitation of the behavioral cloning (BC) approach to imitation learning is that it only teaches an agent what the expert did at states the expert visited. This means that when a BC agent makes a mistake which takes them out of the support of the demonstrations, they often don't know how to recover from it. In this sense, BC is akin to giving the agent the fish - giving them dense supervision across a narrow set of states - rather than teaching them to fish: to be able to reason independently about achieving the expert's outcome even when faced with unseen situations at test-time. In response, we explore learning to search (L2S) from expert demonstrations, i.e. learning the components required to, at test time, plan to match expert outcomes, even after making a mistake. These include (1) a world model and (2) a reward model. We carefully ablate the set of algorithmic and design decisions required to combine these and other components for stable and sample/interaction-efficient learning of recovery behavior without additional human corrections. Across a dozen visual manipulation tasks from three benchmarks, our approach SAILORconsistently out-performs state-of-the-art Diffusion Policies trained via BC on the same data. Furthermore, scaling up the amount of demonstrations used for BC by 5-10 still leaves a performance gap. We find that SAILORcan identify nuanced failures and is robust to reward hacking.


MyoChallenge 2024: ANew Benchmark for Physiological Dexterity and Agility in Bionic Humans

Neural Information Processing Systems

Recent advancements in bionic prosthetic technology offer transformative opportunities to restore mobility and functionality for individuals with missing limbs. Users of bionic limbs, or bionic humans, learn to seamlessly integrate prosthetic extensions into their motor repertoire, regaining critical motor abilities. The remarkable movement generalization and environmental adaptability demonstrated by these individuals highlight motor intelligence capabilities unmatched by current artificial intelligence systems. Addressing these limitations, MyoChallenge'24 at NeurIPS 2024 established a benchmark for human-robot coordination with an emphasis on joint control of both biological and mechanical limbs. The competition featured two distinct tracks: a manipulation task utilizing the myoMPL model, integrating a virtual biological arm and the Modular Prosthetic Limb (MPL) for a passover task; and a locomotion task using the novel myoOSL model, combining a bilateral virtual biological leg with a trans-femoral amputation and the Open Source Leg (OSL) to navigate varied terrains. Marking the third iteration of the MyoChallenge, the event attracted over 50 teams with more than 290 submissions all around the globe, with diverse participants ranging from independent researchers to high school students. The competition facilitated the development of several state-of-the-art control algorithms for bionic musculoskeletal systems, leveraging techniques such as imitation learning, muscle synergy, and model-based reinforcement learning that significantly surpassed our proposed baseline performance by a factor of 10. By providing the open-source simulation framework of MyoSuite, standardized tasks, and physiologically realistic models, MyoChallenge serves as a reproducible testbed and benchmark for bridging ML and biomechanics.


Vision Foundation Models as Effective Visual for Generation

Neural Information Processing Systems

In this work, we present a novel direction to build an image tokenizer directly on top of a frozen vision foundation model, which is a largely underexplored area. Specifically, we employ a frozen vision foundation model as the encoder of our tokenizer. To enhance its effectiveness, we introduce two key components: (1) a region-adaptive quantization framework that reduces redundancy in the pre-trained features on regular 2D grids, and (2) a semantic reconstruction objective that aligns the tokenizer's outputs with the foundation model's representations to preserve semantic fidelity. Based on these designs, our proposed image tokenizer, VFMTok, achieves substantial improvements in image reconstruction and generation quality, while also enhancing token efficiency. It further boosts autoregressive (AR) generation--achieving a gFID of 1.36 on ImageNet benchmarks, while accelerating model convergence by three times, and enabling high-fidelity class-conditional synthesis without the need for classifier-free guidance (CFG). The code is available at https://github.com/CVMI-Lab/VFMTok.


FADRM: Fast and Accurate Data Residual Matching for Dataset Distillation

Neural Information Processing Systems

Residual connection has been extensively studied and widely applied at the model architecture level. However, its potential in the more challenging data-centric approaches remains unexplored. In this work, we introduce the concept of Data Residual Matching for the first time, leveraging data-level skip connections to facilitate data generation and mitigate data information vanishing. This approach maintains a balance between newly acquired knowledge through pixel space optimization and existing core local information identification within raw data modalities, specifically for the dataset distillation task. Furthermore, by incorporating training-time refinements, our method significantly improves computational efficiency, achieving superior performance while reducing training time and peak GPU memory usage by 50%. Consequently, the proposed method Fast and Accurate Data Residual Matching for Dataset Distillation (FADRM) establishes a new stateof-the-art, demonstrating substantial improvements over existing methods across multiple dataset benchmarks in both efficiency and effectiveness. For instance, with ResNet-18 as the student model and a 0.8% compression ratio on ImageNet-1K, the method achieves 48.4% test accuracy in single-model dataset distillation and 50.9% in multi-model dataset distillation, surpassing RDED by +6.4% and outperforming


EquiTabPFN: ATarget-Permutation Equivariant Prior Fitted Network

Neural Information Processing Systems

Recent foundational models for tabular data, such as TabPFN, excel at adapting to new tasks via in-context learning, but remain constrained to a fixed, pre-defined number of target dimensions--often necessitating costly ensembling strategies. We trace this constraint to a deeper architectural shortcoming: these models lack target equivariance, so that permuting target dimension orderings alters their predictions. This deficiency gives rise to an irreducible "equivariance gap," an error term that introduces instability in predictions. We eliminate this gap by designing a fully target-equivariant architecture--ensuring permutation invariance via equivariant encoders, decoders, and a bi-attention mechanism. Empirical evaluation on standard classification benchmarks shows that, on datasets with more classes than those seen during pre-training, our model matches or surpasses existing methods while incurring lower computational overhead.


Millions told to prepare NOW as Tropical Storm Warning is issued along US coast: 'Arthur is coming'

Daily Mail - Science & tech

'Ringleader' of alleged UFC drone attack to kill Trump is unmasked as illegal migrant who was granted DACA stay under Obama Spy world panic as Tulsi Gabbard prepares to unleash bombshell file dumps on secret CIA'mind control' project and Dr. Fauci Watch horrifying drone video that follows woman's plunge to death after bungee team threw her from bridge without rope Horrific new videos blow Texas woman's mystery death wide open: Her agonizing'final gasp'... unthinkably vile corpse claims... and sick past of man who saw her last Taylor Swift's bottomless thirst for attention, her greed and sheer tackiness are now truly unbearable... this latest stunt has shown her true colors: MAUREEN CALLAHAN Kanye West's wife Bianca Censori raises eyebrows in plunging white lace lingerie as she photographs a nude model at Art Basel in Switzerland Trump says'fools who think I haven't been tough enough on Iran' are'jealous or stupid' after signing widely-criticised deal that includes giving Tehran $300billion Father keeps his cool as shouting man calls cops on him for taking his two young daughters into women's restroom All my friends are suddenly getting divorced. Mid-life wives share taboo sex confessions about why they really leave... including common position that made one hate her husband: JANA HOCKING Sensational REAL reason Jelly Roll is divorcing Bunnie XO: Insiders reveal'preacher's wife' bombshell that's the talk of Nashville... truth about legendary rocker cuckolding rumor... and G-string mishap Brooklyn Beckham is savaged by fans for yet another'classless' swipe at his estranged family as new DoorDash ad is branded a'giant PR mess' LIZ JONES: The cracks in Harry and Meghan's perfect facade have started to show. It's so obvious he's tiring of her tone-deaf approach... and I predict there's serious trouble in store Every emotional moment from the Gilgo Beach killer's sentencing: Rex Heuermann's shocking first words... and the chilling exchange that silenced the room Tropical Storm Arthur has formed in the Gulf, becoming the first named storm of the 2026 Atlantic hurricane season. The National Hurricane Center (NHC) announced Wednesday morning that Arthur had strengthened into a tropical storm with maximum sustained winds of 40mph. The storm was located about 40 miles northeast of Port O'Connor, Texas, and about 190 miles west-southwest of Lake Charles, Louisiana .