Goto

Collaborating Authors

 Technology


Inspired by Ukraine, and worried by China: Taiwan teaches its citizens how to fly drones

The Guardian

I n a small, crowded room in Taipei, Pan Chien-chin is trying to keep a drone hovering steadily. Imagining himself flying a plane, he gently nudges controller joysticks to guide the insect-like device as it hums through the air. Cheers break out as Pan, who has never flown a drone before, steers it around a rectangular course marked by traffic cones without crashing. Around him are about two dozen fellow trainees, all signed up for the same course: Taiwan's first civil defence drone training programme. "The war in Ukraine has really changed how drones are used," says Pan, 48, a food company worker. "It's like giving myself another skill, something I can use if it's ever needed one day," he adds.


Frรฉchet Geodesic Boosting

Neural Information Processing Systems

Gradient boosting has become a cornerstone of machine learning, enabling base learners such as decision trees to achieve exceptional predictive performance. While existing algorithms primarily handle scalar or Euclidean outputs, increasingly prevalent complex-structured data, such as distributions, networks, and manifoldvalued outputs, present challenges for traditional methods. Such non-Euclidean data lack algebraic structures such as addition, subtraction, or scalar multiplication required by standard gradient boosting frameworks. To address these challenges, we introduce Frรฉchet geodesic boosting (FGBoost), a novel approach tailored for outputs residing in geodesic metric spaces. FGBoost leverages geodesics as proxies for residuals and constructs ensembles in a way that respects the intrinsic geometry of the output space. Through theoretical analysis, extensive simulations, and realworld applications, we demonstrate the strong performance and adaptability of FGBoost, showcasing its potential for modeling complex data.


Probabilistic Stability Guarantees for Feature Attributions

Neural Information Processing Systems

Stability guarantees have emerged as a principled way to evaluate feature attributions, but existing certification methods rely on heavily smoothed classifiers and often produce conservative guarantees. To address these limitations, we introduce soft stability and propose a simple, model-agnostic, sample-efficient stability certification algorithm (SCA) that yields non-trivial and interpretable guarantees for any attribution method. Moreover, we show that mild smoothing achieves a more favorable trade-off between accuracy and stability, avoiding the aggressive compromises made in prior certification methods. To explain this behavior, we use Boolean function analysis to derive a novel characterization of stability under smoothing. We evaluate SCA on vision and language tasks and demonstrate the effectiveness of soft stability in measuring the robustness of explanation methods.


CLIPTTA: Robust Contrastive Vision-Language Test-Time Adaptation

Neural Information Processing Systems

Vision-language models (VLMs) like CLIP exhibit strong zero-shot capabilities but often fail to generalize under distribution shifts. Test-time adaptation (TTA) allows models to update at inference time without labeled data, typically via entropy minimization. However, this objective is fundamentally misaligned with the contrastive image-text training of VLMs, limiting adaptation performance and introducing failure modes such as pseudo-label drift and class collapse. We propose CLIPTTA, a new gradient-based TTA method for vision-language models that leverages a soft contrastive loss aligned with CLIP's pre-training objective. We provide a theoretical analysis of CLIPTTA's gradients, showing how its batchaware design mitigates the risk of collapse. We further extend CLIPTTA to the open-set setting, where both in-distribution (ID) and out-of-distribution (OOD) samples are encountered, using an Outlier Contrastive Exposure (OCE) loss to improve OOD detection. Evaluated on 75 datasets spanning diverse distribution shifts, CLIPTTA consistently outperforms entropy-based objectives and is highly competitive with state-of-the-art TTA methods, outperforming them on a large number of datasets and exhibiting more stable performance across diverse shifts. Source code is available at: CLIPTTARepository.


Renowned scientist Neil deGrasse Tyson tells US government to 'Show the alien!' after latest UFO disclosure

Daily Mail - Science & tech

Trump turns on the charm after extended'alpha' handshake with Macron and kisses for Brigitte at Palace of Versailles Sensational REAL reason Jelly Roll is divorcing Bunnie XO: Insiders reveal'preacher's wife' bombshell that's the talk of Nashville... truth about legendary rocker cuckolding rumor... and G-string mishap LIZ JONES: The cracks in Harry and Meghan's perfect facade have started to show. It's so obvious he's tiring of her tone-deaf approach... and I predict there's serious trouble in store Taylor Swift's bottomless thirst for attention, her greed and sheer tackiness are now truly unbearable... this latest stunt has shown her true colors: MAUREEN CALLAHAN NBA star's fiancee breaks her silence after friend, 26, mysteriously dropped dead at her luxury bachelorette party in St Barts Luxury fashion tycoon beloved by the stars hangs her head in shame as she's indicted for allegedly exploiting her workers and stealing $50k from their wages Jeff Bezos mercilessly mocked for taking'fake phone calls' when out with wife Lauren Sanchez Anguished family members flee court over sick details of Gilgo Beach murderer's kill room: Live updates'She has not been transparent... the damage has been done': How influencer Elle Darby'betrayed' thousands of young female fans...as insiders tell MOLLY CLAYTON how she cashed in As a divorced mother-of-three, cocaine was my little treat while my fellow middle-class friends had a few wines. What happened next was every family's worst nightmare... this is my warning to mums who'dabble' Desperate search for mom-of-three who hasn't been seen in three days as husband pleads for her return The shocking betrayal behind Jelly Roll's divorce from Bunnie XO is so utterly cruel... but have you yet spotted her revenge: JACQUELYNN POWERS Devastating supply crunch forces Apple to raise prices on iPhones and other devices, calling the move'unavoidable' Jeff's Dream Team: Bezos recruits world's top architects to build most expensive mega mansion on Billionaire Bunker island The Ring star Daveigh Chase's friends searched for her on LA's Skid Row in months before her shock death at 35 Watch horrifying drone video that follows woman's plunge to death after bungee team threw her from bridge without rope Renowned scientist Neil deGrasse Tyson tells US government to'Show the alien!' after latest UFO disclosure Astrophysicist Neil deGrasse Tyson wants the US government to'Show the alien!' after the latest UFO disclosure from the Trump administration . Tyson, 67, joined The Fox News Rundown on Monday, explaining how the overwhelming amount of irrefutable evidence should finally be topped off with a picture of an extraterrestrial being. 'Is it too much to ask at this point for them to just show the alien?


Watermarking Autoregressive Image Generation

Neural Information Processing Systems

Watermarking the outputs of generative models has emerged as a promising approach for tracking their provenance. Despite significant interest in autoregressive image generation models and their potential for misuse, no prior work has attempted the first such to watermark approach their by adapting outputs language at the tok model en level.


Equivariance by Contrast: Identifiable Equivariant Embeddings from Unlabeled Finite Group Actions

Neural Information Processing Systems

We propose Equivariance by Contrast (EbC) to learn equivariant embeddings from observation pairs (y,g y), where g is drawn from a finite group acting on the data. Our method jointly learns a latent space and a group representation in which group actions correspond to invertible linear maps--without relying on group-specific inductive biases. We validate our approach on the infinite dSprites dataset with structured transformations defined by the finite group G:= (Rm Zn Zn), combining discrete rotations and periodic translations. The resulting embeddings exhibit high-fidelity equivariance, with group operations faithfully reproduced in latent space.


Japan's defense chief challenges China's military spending data

The Japan Times

Drawing a contrast with China, Koizumi has said that Japan would take a transparent approach to investing in new methods of warfare like drones and artificial intelligence.


Value-Guided Decision Transformer: AUnified Reinforcement Learning Framework for Online and Offline Settings

Neural Information Processing Systems

The Conditional Sequence Modeling (CSM) paradigm, benefiting from the transformer's powerful distribution modeling capabilities, has demonstrated considerable promise in Reinforcement Learning (RL) tasks. However, much of the work has focused on applying CSM to single online or offline settings, with the general architecture rarely explored. Additionally, existing methods primarily focus on deterministic trajectory modeling, overlooking the randomness of state transitions and the diversity of future trajectory distributions. Fortunately, value-based methods offer a viable solution for CSM, further bridging the potential gap between offline and online RL. In this paper, we propose Value-Guided Decision Transformer (VDT), which leverages value functions to perform advantage-weighting and behavior regularization on the Decision Transformer (DT), guiding the policy toward upper-bound optimal decisions during the offline training phase.


Target Speaker Extraction through Comparing Noisy Positive and Negative Audio Enrollments

Neural Information Processing Systems

Target speaker extraction focuses on isolating a specific speaker's voice from an audio mixture containing multiple speakers. To provide information about the target speaker's identity, prior works have utilized clean audio samples as conditioning inputs. However, such clean audio examples are not always readily available. For instance, obtaining a clean recording of a stranger's voice at a cocktail party without leaving the noisy environment is generally infeasible. Limited prior research has explored extracting the target speaker's characteristics from noisy enrollments, which may contain overlapping speech from interfering speakers. In this work, we explore a novel enrollment strategy that encodes target speaker information from the noisy enrollment by comparing segments where the target speaker is talking (Positive Enrollments) with segments where the target speaker is silent (Negative Enrollments). Experiments show the effectiveness of our model architecture, which achieves over 2.1 dB higher SI-SNRi compared to prior works in extracting the monaural speech from the mixture of two speakers. Additionally, the proposed two-stage training strategy accelerates convergence, reducing the number of optimization steps required to reach 3 dBSNR by 60%. Overall, our method achieves state-of-the-art performance in the monaural target speaker extraction conditioned on noisy enrollments.