Goto

Collaborating Authors

 prototype


TimeXL: Explainable Multi-modal Time Series Prediction with LLM-in-the-Loop

Neural Information Processing Systems

Time series analysis provides essential insights for real-world system dynamics and informs downstream decision-making, yet most existing methods often overlook the rich contextual signals present in auxiliary modalities. To bridge this gap, we introduce TimeXL, a multi-modal prediction framework that integrates a prototypebased time series encoder with three collaborating Large Language Models (LLMs) to deliver more accurate predictions and interpretable explanations. First, a multimodal prototype-based encoder processes both time series and textual inputs to generate preliminary forecasts alongside case-based rationales. These outputs then feed into a prediction LLM, which refines the forecasts by reasoning over the encoder's predictions and explanations. Next, a reflection LLM compares the predicted values against the ground truth, identifying textual inconsistencies or noise. Guided by this feedback, a refinement LLM iteratively enhances text quality and triggers encoder retraining. This closed-loop workflow--prediction, critique (reflect), and refinement--continuously boosts the framework's performance and interpretability. Empirical evaluations on four real-world datasets demonstrate that TimeXL achieves up to 8.9% improvement in AUC and produces human-centric, multi-modal explanations, highlighting the power of LLM-driven reasoning for time series prediction.


Policy Compatible Skill Incremental Learning via Lazy Learning Interface

Neural Information Processing Systems

Skill Incremental Learning (SIL) is the process by which an embodied agent expands and refines its skill set over time by leveraging experience gained through interaction with its environment or by the integration of additional data. SIL facilitates efficient acquisition of hierarchical policies grounded in reusable skills for downstream tasks. However, as the skill repertoire evolves, it can disrupt compatibility with existing skill-based policies, limiting their reusability and generalization. In this work, we propose SIL-C, a novel framework that ensures skill-policy compatibility, allowing improvements in incrementally learned skills to enhance the performance of downstream policies without requiring policy re-training or structural adaptation. SIL-C employs a bilateral lazy learning-based mapping technique to dynamically align the subtask space referenced by policies with the skill space decoded into agent behaviors. This enables each subtask, derived from the policy's decomposition of a complex task, to be executed by selecting an appropriate skill based on trajectory distribution similarity. We evaluate SIL-C across diverse SIL scenarios and demonstrate that it maintains compatibility between evolving skills and downstream policies while ensuring efficiency throughout the learning process.


Leaving No OODInstance Behind: Instance-Level OODFine-Tuning for Anomaly Segmentation

Neural Information Processing Systems

Out-of-distribution (OOD) fine-tuning has emerged as a promising approach for anomaly segmentation. Current OOD fine-tuning strategies typically employ global-level objectives, aiming to guide segmentation models to accurately predict a large number of anomaly pixels. However, these strategies often perform poorly on small anomalies. To address this issue, we propose an instance-level OOD fine-tuning framework, dubbed LNOIB (Leaving No OODInstance Behind). We start by theoretically analyzing why global-level objectives fail to segment small anomalies. Building on this analysis, we introduce a simple yet effective instancelevel objective. Moreover, we propose a feature separation objective to explicitly constrain the representations of anomalies, which are prone to be smoothed by their in-distribution (ID) surroundings. LNOIB integrates these objectives to enhance the segmentation of small anomalies and serves as a paradigm adaptable to existing OOD fine-tuning strategies, without introducing additional inference cost. Experimental results show that integrating LNOIB into various OOD fine-tuning strategies yields significant improvements, particularly in component-level results, highlighting its strength in comprehensive anomaly segmentation.


ProtoPairNet: Interpretable Regression through Prototypical Pair Reasoning

Neural Information Processing Systems

We present Prototypical Pair Network (ProtoPairNet), a novel interpretable architecture that combines deep learning with case-based reasoning to predict continuous targets. While prototype-based models have primarily addressed image classification with discrete outputs, extending these methods to continuous targets, such as regression, poses significant challenges. Existing architectures which rely heavily on one-to-one comparison with prototypes lack the directional information necessary for continuous predictions.


Novel Class Discovery for Point Cloud Segmentation via Joint Learning of Causal Representation and Reasoning

Neural Information Processing Systems

In this paper, we focus on Novel Class Discovery for Point Cloud Segmentation (3D-NCD), aiming to learn a model that can segment unlabeled (novel) 3D classes using only the supervision from labeled (base) 3D classes. The key to this task is to setup the exact correlations between the point representations and their base class labels, as well as the representation correlations between the points from base and novel classes. A coarse or statistical correlation learning may lead to the confusion in novel class inference.


Unbiased Prototype Consistency Learning for Multi-Modal and Multi-Task Object Re-Identification

Neural Information Processing Systems

In object re-identification (ReID) task, both cross-modal and multi-modal retrieval methods have achieved notable progress. However, existing approaches are designed for specific modality and category (person or vehicle) retrieval task, lacking generalizability to others. Acquiring multiple task-specific models would result in wasteful allocation of both training and deployment resources. To address the practical requirements for unified retrieval, we introduce Multi-Modal and MultiTask object ReID (M3T-ReID). The M3T-ReID task aims to utilize a unified model to simultaneously achieve retrieval tasks across different modalities and different categories. Specifically, to tackle the challenges of modality distibution divergence and category semantics discrepancy posed in M3T-ReID, we design a novel Unbiased Prototype Consistency Learning (UPCL) framework, which consists of two main modules: Unbiased Prototypes-guided Modality Enhancement (UPME) and Cluster Prototype Consistency Regularization (CPCR).


Stonehenge's secret SISTER: Archaeologists discover an ancient monument just three miles away that may have served as a 'prototype' for the famous stones

Daily Mail - Science & tech

Trump turns on the charm after extended'alpha' handshake with Macron and kisses for Brigitte at Palace of Versailles Sensational REAL reason Jelly Roll is divorcing Bunnie XO: Insiders reveal'preacher's wife' bombshell that's the talk of Nashville... truth about legendary rocker cuckolding rumor... and G-string mishap LIZ JONES: The cracks in Harry and Meghan's perfect facade have started to show. It's so obvious he's tiring of her tone-deaf approach... and I predict there's serious trouble in store Taylor Swift's bottomless thirst for attention, her greed and sheer tackiness are now truly unbearable... this latest stunt has shown her true colors: MAUREEN CALLAHAN NBA star's fiancee breaks her silence after friend, 26, mysteriously dropped dead at her luxury bachelorette party in St Barts Luxury fashion tycoon beloved by the stars hangs her head in shame as she's indicted for allegedly exploiting her workers and stealing $50k from their wages Jeff Bezos mercilessly mocked for taking'fake phone calls' when out with wife Lauren Sanchez Anguished family members flee court over sick details of Gilgo Beach murderer's kill room: Live updates'She has not been transparent... the damage has been done': How influencer Elle Darby'betrayed' thousands of young female fans...as insiders tell MOLLY CLAYTON how she cashed in As a divorced mother-of-three, cocaine was my little treat while my fellow middle-class friends had a few wines. What happened next was every family's worst nightmare... this is my warning to mums who'dabble' Desperate search for mom-of-three who hasn't been seen in three days as husband pleads for her return The shocking betrayal behind Jelly Roll's divorce from Bunnie XO is so utterly cruel... but have you yet spotted her revenge: JACQUELYNN POWERS Devastating supply crunch forces Apple to raise prices on iPhones and other devices, calling the move'unavoidable' Jeff's Dream Team: Bezos recruits world's top architects to build most expensive mega mansion on Billionaire Bunker island The Ring star Daveigh Chase's friends searched for her on LA's Skid Row in months before her shock death at 35 Watch horrifying drone video that follows woman's plunge to death after bungee team threw her from bridge without rope Stonehenge's secret SISTER: Archaeologists discover an ancient monument just three miles away that may have served as a'prototype' for the famous stones Archaeologists have discovered a secret sister monument to Stonehenge that might have served as a'prototype' for the famous stones. This ancient site is just three miles away from Stonehenge itself, located in the village of Bulford, Wiltshire. Consisting of two wooden poles placed 400 feet (120 metres) apart, this long-lost monument might appear rather basic at first glance.


MetaSlot: Break Through the Fixed Number of Slots in Object-Centric Learning

Neural Information Processing Systems

Learning object-level, structured representations is widely regarded as a key to better generalization in vision and underpins the design of next-generation Pre-trained Vision Models (PVMs). Mainstream Object-Centric Learning (OCL) methods adopt Slot Attention or its variants to iteratively aggregate objects' super-pixels into a fixed set of query feature vectors, termed slots. However, their reliance on a static slot count leads to an object being represented as multiple parts when the number of objects varies. We introduce MetaSlot, a plug-and-play Slot Attention variant that adapts to variable object counts. MetaSlot (i) maintains a codebook that holds prototypes of objects in a dataset by vector-quantizing the resulting slot representations; (ii) removes duplicate slots from the traditionally aggregated slots by quantizing them with the codebook; and (iii) injects progressively weaker noise into the Slot Attention iterations to accelerate and stabilize the aggregation. MetaSlot is a general Slot Attention variant that can be seamlessly integrated into existing OCL architectures. Across multiple public datasets and tasks-including object discovery and recognition-models equipped with MetaSlot achieve significant performance gains and markedly interpretable slot representations, compared with existing Slot Attention variants.


HYPERION: Fine-Grained Hypersphere Alignment for Robust Federated Graph Learning

Neural Information Processing Systems

Robust Federated Graph Learning (FGL) provides an effective decentralized framework for training Graph Neural Networks (GNNs) in noisy-label environments. However, the subtlety of noise during training presents formidable obstacles for developing robust FGL systems. Previous robust FL approaches neither adequately constrain edge-mediated error propagation nor account for intra-class topological differences. At the client level, we innovatively demonstrate that hyperspherical embedding can effectively capture graph structures in a fine-grained manner. Correspondingly, our method effectively addresses the aforementioned issues through fine-grained hypersphere alignment. Moreover, we uncover undetected noise arising from localized perspective constraints and propose the geometricaware hyperspherical purification module at the server level. Combining both level strategies, we present our robust FGL framework, HYPERION, which operates all components within a unified hyperspherical space. HYPERION demonstrates remarkable robustness across multiple datasets, for instance, achieving a 29.7% F1-macro score with 50%-pair noise on Cora.


DAA: Amplifying Unknown Discrepancy for Test-Time Discovery

Neural Information Processing Systems

Test-Time Discovery (TTD) addresses the critical challenge of identifying and adapting to novel classes during inference while maintaining performance on known classes, which is a capability essential for dynamic real-world environments such as healthcare and autonomous driving. Recent TTD methods adopt training-free, memory-based strategies but rely on frozen models and static representations, resulting in poor generalization. In this paper, we propose a DiscrepancyAmplifying Adapter (DAA), a trainable module that enables real-time adaptation by amplifying feature-level discrepancies between known and unknown classes. During training, DAA is optimized using simulated unknowns and a novel warmup strategy to enhance its discriminative capacity. To ensure continual adaptation at test time, we introduce a Short-Term Memory Renewal (STMR) mechanism, which maintains a queue-based memory for unknown classes and selectively refreshes prototypes using recent, reliable samples. DAA is further updated through self-supervised learning, promoting knowledge retention for known classes while improving discrimination of emerging categories. Extensive experiments show that our method maintains high adaptability and stability, and significantly improves novel class discovery performance.