Technology
ASmooth Sea Never Made a Skilled SAILOR: Robust Imitation via Learning to Search
The fundamental limitation of the behavioral cloning (BC) approach to imitation learning is that it only teaches an agent what the expert did at states the expert visited. This means that when a BC agent makes a mistake which takes them out of the support of the demonstrations, they often don't know how to recover from it. In this sense, BC is akin to giving the agent the fish - giving them dense supervision across a narrow set of states - rather than teaching them to fish: to be able to reason independently about achieving the expert's outcome even when faced with unseen situations at test-time. In response, we explore learning to search (L2S) from expert demonstrations, i.e. learning the components required to, at test time, plan to match expert outcomes, even after making a mistake. These include (1) a world model and (2) a reward model. We carefully ablate the set of algorithmic and design decisions required to combine these and other components for stable and sample/interaction-efficient learning of recovery behavior without additional human corrections. Across a dozen visual manipulation tasks from three benchmarks, our approach SAILORconsistently out-performs state-of-the-art Diffusion Policies trained via BC on the same data. Furthermore, scaling up the amount of demonstrations used for BC by 5-10 still leaves a performance gap. We find that SAILORcan identify nuanced failures and is robust to reward hacking.
MyoChallenge 2024: ANew Benchmark for Physiological Dexterity and Agility in Bionic Humans
Recent advancements in bionic prosthetic technology offer transformative opportunities to restore mobility and functionality for individuals with missing limbs. Users of bionic limbs, or bionic humans, learn to seamlessly integrate prosthetic extensions into their motor repertoire, regaining critical motor abilities. The remarkable movement generalization and environmental adaptability demonstrated by these individuals highlight motor intelligence capabilities unmatched by current artificial intelligence systems. Addressing these limitations, MyoChallenge'24 at NeurIPS 2024 established a benchmark for human-robot coordination with an emphasis on joint control of both biological and mechanical limbs. The competition featured two distinct tracks: a manipulation task utilizing the myoMPL model, integrating a virtual biological arm and the Modular Prosthetic Limb (MPL) for a passover task; and a locomotion task using the novel myoOSL model, combining a bilateral virtual biological leg with a trans-femoral amputation and the Open Source Leg (OSL) to navigate varied terrains. Marking the third iteration of the MyoChallenge, the event attracted over 50 teams with more than 290 submissions all around the globe, with diverse participants ranging from independent researchers to high school students. The competition facilitated the development of several state-of-the-art control algorithms for bionic musculoskeletal systems, leveraging techniques such as imitation learning, muscle synergy, and model-based reinforcement learning that significantly surpassed our proposed baseline performance by a factor of 10. By providing the open-source simulation framework of MyoSuite, standardized tasks, and physiologically realistic models, MyoChallenge serves as a reproducible testbed and benchmark for bridging ML and biomechanics.
Vision Foundation Models as Effective Visual for Generation
In this work, we present a novel direction to build an image tokenizer directly on top of a frozen vision foundation model, which is a largely underexplored area. Specifically, we employ a frozen vision foundation model as the encoder of our tokenizer. To enhance its effectiveness, we introduce two key components: (1) a region-adaptive quantization framework that reduces redundancy in the pre-trained features on regular 2D grids, and (2) a semantic reconstruction objective that aligns the tokenizer's outputs with the foundation model's representations to preserve semantic fidelity. Based on these designs, our proposed image tokenizer, VFMTok, achieves substantial improvements in image reconstruction and generation quality, while also enhancing token efficiency. It further boosts autoregressive (AR) generation--achieving a gFID of 1.36 on ImageNet benchmarks, while accelerating model convergence by three times, and enabling high-fidelity class-conditional synthesis without the need for classifier-free guidance (CFG). The code is available at https://github.com/CVMI-Lab/VFMTok.
FADRM: Fast and Accurate Data Residual Matching for Dataset Distillation
Residual connection has been extensively studied and widely applied at the model architecture level. However, its potential in the more challenging data-centric approaches remains unexplored. In this work, we introduce the concept of Data Residual Matching for the first time, leveraging data-level skip connections to facilitate data generation and mitigate data information vanishing. This approach maintains a balance between newly acquired knowledge through pixel space optimization and existing core local information identification within raw data modalities, specifically for the dataset distillation task. Furthermore, by incorporating training-time refinements, our method significantly improves computational efficiency, achieving superior performance while reducing training time and peak GPU memory usage by 50%. Consequently, the proposed method Fast and Accurate Data Residual Matching for Dataset Distillation (FADRM) establishes a new stateof-the-art, demonstrating substantial improvements over existing methods across multiple dataset benchmarks in both efficiency and effectiveness. For instance, with ResNet-18 as the student model and a 0.8% compression ratio on ImageNet-1K, the method achieves 48.4% test accuracy in single-model dataset distillation and 50.9% in multi-model dataset distillation, surpassing RDED by +6.4% and outperforming
EquiTabPFN: ATarget-Permutation Equivariant Prior Fitted Network
Recent foundational models for tabular data, such as TabPFN, excel at adapting to new tasks via in-context learning, but remain constrained to a fixed, pre-defined number of target dimensions--often necessitating costly ensembling strategies. We trace this constraint to a deeper architectural shortcoming: these models lack target equivariance, so that permuting target dimension orderings alters their predictions. This deficiency gives rise to an irreducible "equivariance gap," an error term that introduces instability in predictions. We eliminate this gap by designing a fully target-equivariant architecture--ensuring permutation invariance via equivariant encoders, decoders, and a bi-attention mechanism. Empirical evaluation on standard classification benchmarks shows that, on datasets with more classes than those seen during pre-training, our model matches or surpasses existing methods while incurring lower computational overhead.
Millions told to prepare NOW as Tropical Storm Warning is issued along US coast: 'Arthur is coming'
'Ringleader' of alleged UFC drone attack to kill Trump is unmasked as illegal migrant who was granted DACA stay under Obama Spy world panic as Tulsi Gabbard prepares to unleash bombshell file dumps on secret CIA'mind control' project and Dr. Fauci Watch horrifying drone video that follows woman's plunge to death after bungee team threw her from bridge without rope Horrific new videos blow Texas woman's mystery death wide open: Her agonizing'final gasp'... unthinkably vile corpse claims... and sick past of man who saw her last Taylor Swift's bottomless thirst for attention, her greed and sheer tackiness are now truly unbearable... this latest stunt has shown her true colors: MAUREEN CALLAHAN Kanye West's wife Bianca Censori raises eyebrows in plunging white lace lingerie as she photographs a nude model at Art Basel in Switzerland Trump says'fools who think I haven't been tough enough on Iran' are'jealous or stupid' after signing widely-criticised deal that includes giving Tehran $300billion Father keeps his cool as shouting man calls cops on him for taking his two young daughters into women's restroom All my friends are suddenly getting divorced. Mid-life wives share taboo sex confessions about why they really leave... including common position that made one hate her husband: JANA HOCKING Sensational REAL reason Jelly Roll is divorcing Bunnie XO: Insiders reveal'preacher's wife' bombshell that's the talk of Nashville... truth about legendary rocker cuckolding rumor... and G-string mishap Brooklyn Beckham is savaged by fans for yet another'classless' swipe at his estranged family as new DoorDash ad is branded a'giant PR mess' LIZ JONES: The cracks in Harry and Meghan's perfect facade have started to show. It's so obvious he's tiring of her tone-deaf approach... and I predict there's serious trouble in store Every emotional moment from the Gilgo Beach killer's sentencing: Rex Heuermann's shocking first words... and the chilling exchange that silenced the room Tropical Storm Arthur has formed in the Gulf, becoming the first named storm of the 2026 Atlantic hurricane season. The National Hurricane Center (NHC) announced Wednesday morning that Arthur had strengthened into a tropical storm with maximum sustained winds of 40mph. The storm was located about 40 miles northeast of Port O'Connor, Texas, and about 190 miles west-southwest of Lake Charles, Louisiana .
Normal-Abnormal Guided Generalist Anomaly Detection
Generalist Anomaly Detection (GAD) aims to train a unified model on an original domain that can detect anomalies in new target domains. Previous GAD methods primarily use only normal samples as references, overlooking the valuable information contained in anomalous samples that are often available in real-world scenarios. To address this limitation, we propose a more practical approach: normal-abnormalguided generalist anomaly detection, which leverages both normal and anomalous samples as references to guide anomaly detection across diverse domains. We introduce the Normal-Abnormal Generalist Learning (NAGL) framework, consisting of two key components: Residual Mining (RM) and Anomaly Feature Learning (AFL). RM extracts abnormal patterns from normal-abnormal reference residuals to establish transferable anomaly representations, while AFL adaptively learns anomaly features in query images through residual mapping to identify instance-aware anomalies.
These 6-foot Anker USB-C cables are under 4 each
When you purchase through links in our articles, we may earn a small commission. These 6-foot Anker USB-C cables are durable, convenient, and on sale for under $4 each. It's always a good time to upgrade your cable collection, especially since you can get 6ft 60W USB-C Anker cables for as little as $3.49 each . Woot is running a massive sale, offering two of these cables for $7.99 (down from $9.99) or four of them for $13.99 (down from $19.99). That's a tough deal to beat.
Non-Stationary Structural Causal Bandits
We study the problem of sequential decision-making in environments governed by evolving causal mechanisms. Prior work on structural causal bandits--formulations that integrate causal graphs into multi-armed bandit problems to guide intervention selection--has shown that leveraging the causal structure can reduce unnecessary interventions by identifying possibly-optimal minimal intervention sets (POMISs). However, such formulations fall short in dynamic settings where reward distributions may vary over time, due to their static--and thus myopic--nature focuses on immediate rewards and overlooks the long-term effects of interventions. In this work, we propose a non-stationary structural causal bandit framework that leverages temporal structural causal models to capture evolving dynamics over time. We characterize how interventions propagate over time by developing graphical tools and assumptions, which form the basis for identifying non-myopic intervention strategies. Within this framework, we devise POMIS+, which captures the existence of variables that contribute to maximizing both immediate and long-term rewards. Our framework provides a principled way to reason about temporally-aware interventions by explicitly modeling information propagation across time. Empirical results validate the effectiveness of our approach, demonstrating improved performance over myopic baselines.
EVAAA: AVirtual Environment Platform for Essential Variables in Autonomous and Adaptive Agents
Appendix A describes the Unity-based interface implemented in EVAAA, including an environment setup, prefab structures, and object instantiation. Appendix B provides a comprehensive introduction to Essential Variables (EVs), including their design, dynamics, and role in internal state regulation. Appendix C explains the implementation of the reward system and its connection to the balance of internal states. Appendix E outlines the modular configuration to generate EVAAA environments, along with the instructions for environment customization. Appendix F presents the structure and progression of naturalistic training environments. Appendix G describes the design of unseen experimental testbeds for evaluation. Appendix I provides analyses of agent behavior across training and test environments, including emergent behavioral patterns. All code and data are publicly available at: https://github.com/cocoanlab/evaaa A.1 Prefabs Environmental elements such as terrain, resources, obstacles, and predators are implemented as reusable and configurable Unity prefabs. Prefabs are grouped into Agents, Environment, and Materials. Each category includes reusable components for constructing and customizing interactive scenes: Agents (main agent and predators), Environment (terrain and containers), and Materials (varied textures and colors for visual distinction). This modular system enables rapid prototyping, task generation, condition randomization, and reproducible scene setup. Prefabs can be customized through the Unity Editor or programmatically at runtime, and reused across scenes without manual rebuilding.