AITopics | support sample

Country:

Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
North America > United States > California (0.04)

Genre: Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Neural Information Processing SystemsFeb-7-2026, 08:16:20 GMT

A Proof of Theorem 2 Lemma 1 (McDiarmid)

Lemma 3 are proposed in [20] to prove Theorem 3. Lemma 2 As shown in Eq. (6), the generalization bound of meta-algorithms with LOO training relies on both Stability of Inner-T ask Algorithm. Prototypical networks find the prototype (mean vector) of each class first and then classifying the query into the nearest prototype's class in the embedding space. Based on the definition, the following gives the derivation of the stability parameter β . For the expectation w.r.t. the support set Therefore, we obtain the upper bound of O (1/m) for the hypothesis stability β . Based on the above results, we obtain Theorem 5. Theorem 5

artificial intelligence, machine learning, prototypical network, (17 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.69)

Neural Information Processing SystemsDec-23-2025, 23:11:35 GMT

Feature-Proxy Transformer for Few-Shot Segmentation

Few-shot segmentation~(FSS) aims at performing semantic segmentation on novel classes given a few annotated support samples. With a rethink of recent advances, we find that the current FSS framework has deviated far from the supervised segmentation framework: Given the deep features, FSS methods typically use an intricate decoder to perform sophisticated pixel-wise matching, while the supervised segmentation methods use a simple linear classification head. Due to the intricacy of the decoder and its matching pipeline, it is not easy to follow such an FSS framework. This paper revives the straightforward framework of ``feature extractor $+$ linear classification head'' and proposes a novel Feature-Proxy Transformer (FPTrans) method, in which the ``proxy'' is the vector representing a semantic class in the linear classification head. FPTrans has two keypoints for learning discriminative features and representative proxies: 1) To better utilize the limited support samples, the feature extractor makes the query interact with the support features from bottom to top layers using a novel prompting strategy.

feature-proxy transformer, linear classification head, name change, (8 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.39)

Pantazis, Georgios, Mignoni, Nicola, Carli, Raffaele, Dotoli, Mariagrazia, Grammatico, Sergio

Adversarially and Distributionally Robust Virtual Energy Storage Systems via the Scenario Approach

arXiv.org Artificial IntelligenceNov-13-2025

We propose an optimization model where a parking lot manager (PLM) can aggregate parked EV batteries to provide virtual energy storage services that are provably robust under uncertain EV departures and state-of-charge caps. Our formulation yields a data-driven convex optimization problem where a prosumer community agrees on a contract with the PLM for the provision of storage services over a finite horizon. Leveraging recent results in the scenario approach, we certify out-of-sample constraint safety. Furthermore, we enable a tunable profit-risk trade-off through scenario relaxation and extend our model to account for robustness to adversarial perturbations and distributional shifts over Wasserstein-based ambiguity sets. All the approaches are accompanied by tight finite-sample certificates. Numerical studies demonstrate the out-of-sample and out-of-distribution constraint satisfaction of our proposed model compared to the developed theoretical guarantees, showing their effectiveness and potential in robust and efficient virtual energy services.

artificial intelligence, constraint, optimization problem, (17 more...)

2511.09427

Country:

Europe > Netherlands > South Holland > Delft (0.04)
Europe > Italy > Apulia > Bari (0.04)

Genre: Research Report (0.50)

Industry:

Transportation > Ground > Road (1.00)
Transportation > Electric Vehicle (1.00)
Automobiles & Trucks (1.00)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.87)

arXiv.org Artificial IntelligenceOct-13-2025

Provably Robust Adaptation for Language-Empowered Foundation Models

Lai, Yuni, Xue, Xiaoyu, Shen, Linghui, Wu, Yulun, Li, Gaolei, Guo, Song, Zhou, Kai, Xiao, Bin

Language-empowered foundation models (LeFMs), such as CLIP and GraphCLIP, have transformed multimodal learning by aligning visual (or graph) features with textual representations, enabling powerful downstream capabilities like few-shot learning. However, the reliance on small, task-specific support datasets collected in open environments exposes these models to poisoning attacks, where adversaries manipulate the support samples to degrade performance. Existing defenses rely on empirical strategies, which lack formal guarantees and remain vulnerable to unseen and adaptive attacks. Certified robustness offers provable guarantees but has been largely unexplored for few-shot classifiers based on LeFMs. This study seeks to fill these critical gaps by proposing the first provably robust few-shot classifier that is tailored for LeFMs. We term our model Language-empowered Few-shot Certification (\textbf{LeFCert}). It integrates both textual and feature embeddings with an adaptive blending mechanism. To achieve provable robustness, we propose a twofold trimmed mean prototype and derive provable upper and lower bounds for classification scores, enabling certification under worst-case poisoning scenarios. To further enhance the performance, we extend LeFCert with two variants by considering a more realistic and tighter attack budget: LeFCert-L incorporates randomized smoothing to provide Lipschitz continuity and derive robustness under dual budget constraints, and LeFCert-C provides collective certification for scenarios where attackers distribute a shared poisoning budget across multiple samples. Experiments demonstrate that LeFCert achieves state-of-the-art performance, significantly improving both clean and certified accuracy compared to existing baselines. Despite its advanced robustness mechanisms, LeFCert is computationally efficient, making it practical for real-world applications.

artificial intelligence, machine learning, natural language, (19 more...)

2510.08659

Country:

Asia > China > Hong Kong (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)

Neural Information Processing SystemsOct-10-2025, 23:44:34 GMT

bbb7506579431a85861a05fff048d3e1-Paper-Conference.pdf

information, machine learning, natural language, (18 more...)

Country:

Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
North America > United States > California (0.04)

Genre: Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Neural Information Processing SystemsOct-1-2025, 22:22:52 GMT

0415740eaa4d9decbc8da001d3fd805f-Supplemental.pdf

artificial intelligence, machine learning, prototypical network, (17 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.69)

arXiv.org Artificial IntelligenceSep-29-2025

ECHO: Toward Contextual Seq2Seq Paradigms in Large EEG Models

Liu, Chenyu, Deng, Yuqiu, Liu, Tianyu, Zhou, Jinan, Zhou, Xinliang, Jia, Ziyu, Ding, Yi

Electroencephalography (EEG), with its broad range of applications, necessitates models that can generalize effectively across various tasks and datasets. Large EEG Models (LEMs) address this by pretraining encoder-centric architectures on large-scale unlabeled data to extract universal representations. While effective, these models lack decoders of comparable capacity, limiting the full utilization of the learned features. To address this issue, we introduce ECHO, a novel decoder-centric LEM paradigm that reformulates EEG modeling as sequence-to-sequence learning. ECHO captures layered relationships among signals, labels, and tasks within sequence space, while incorporating discrete support samples to construct contextual cues. This design equips ECHO with in-context learning, enabling dynamic adaptation to heterogeneous tasks without parameter updates. Extensive experiments across multiple datasets demonstrate that, even with basic model components, ECHO consistently outperforms state-of-the-art single-task LEMs in multi-task settings, showing superior generalization and adaptability.

artificial intelligence, machine learning, natural language, (20 more...)

2509.22556

Country:

Europe > Austria > Styria > Graz (0.04)
Asia > China > Shaanxi Province > Xi'an (0.04)
North America > United States (0.04)
(2 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Neural Information Processing SystemsAug-16-2025, 14:26:50 GMT

Alleviating the Sample Selection Bias in Few-shot Learning by Removing Projection to the Centroid

Despite the emergence of a number of few-shot learning methods, the sample selection bias problem, i.e., the sensitivity to the limited amount of support data, has not been well understood.

artificial intelligence, machine learning, task centroid, (16 more...)

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > China > Zhejiang Province (0.04)
Asia > China > Heilongjiang Province > Harbin (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Zhuang, Xuanyu, Peeters, Geoffroy, Richard, Gaël

Episode-specific Fine-tuning for Metric-based Few-shot Learners with Optimization-based Training

arXiv.org Artificial IntelligenceJun-24-2025

In few-shot classification tasks (so-called episodes), a small set of labeled support samples is provided during inference to aid the classification of unlabeled query samples. Metric-based models typically operate by computing similarities between query and support embeddings within a learned metric space, followed by nearest-neighbor classification. However, these labeled support samples are often underutilized--they are only used for similarity comparison, despite their potential to fine-tune and adapt the metric space itself to the classes in the current episode. To address this, we propose a series of simple yet effective episode-specific, during-inference fine-tuning methods for metric-based models, including Rotational Division Fine-Tuning (RDFT) and its two variants, Iterative Division Fine-Tuning (IDFT) and Augmented Division Fine-Tuning (ADFT). These methods construct pseudo support-query pairs from the given support set to enable fine-tuning even for non-parametric models. Nevertheless, the severely limited amount of data in each task poses a substantial risk of overfitting when applying such fine-tuning strategies. To mitigate this, we further propose to train the metric-based model within an optimization-based meta-learning framework. With the combined efforts of episode-specific fine-tuning and optimization-based meta-training, metric-based models are equipped with the ability to rapidly adapt to the limited support samples during inference while avoiding overfitting. We validate our approach on three audio datasets from diverse domains, namely ESC-50 (environmental sounds), Speech Commands V2 (spoken keywords), and Medley-solos-DB (musical instrument). Experimental results demonstrate that our approach consistently improves performance for all evaluated metric-based models (especially for attention-based models) and generalizes well across different audio domains.

artificial intelligence, deep learning, machine learning, (17 more...)