AITopics | bayesian meta-learning

6cdb2cbb2083477cca5243843d6dad06-Paper-Conference.pdf

Neural Information Processing SystemsFeb-13-2026, 17:56:49 GMT

artificial intelligence, likelihood, machine learning, (17 more...)

Neural Information Processing Systems

Country: Asia > China (0.04)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Revisiting Logistic-softmax Likelihood in Bayesian Meta-Learning for Few-Shot Classification

Neural Information Processing SystemsDec-25-2025, 22:47:11 GMT

Meta-learning has demonstrated promising results in few-shot classification (FSC) by learning to solve new problems using prior knowledge. Bayesian methods are effective at characterizing uncertainty in FSC, which is crucial in high-risk fields. In this context, the logistic-softmax likelihood is often employed as an alternative to the softmax likelihood in multi-class Gaussian process classification due to its conditional conjugacy property. However, the theoretical property of logistic-softmax is not clear and previous research indicated that the inherent uncertainty of logistic-softmax leads to suboptimal performance. To mitigate these issues, we revisit and redesign the logistic-softmax likelihood, which enables control of the \textit{a priori} confidence level through a temperature parameter. Furthermore, we theoretically and empirically show that softmax can be viewed as a special case of logistic-softmax and logistic-softmax induces a larger family of data distribution than softmax. Utilizing modified logistic-softmax, we integrate the data augmentation technique into the deep kernel based Gaussian process meta-learning framework, and derive an analytical mean-field approximation for task-specific updates. Our approach yields well-calibrated uncertainty estimates and achieves comparable or superior results on standard benchmark datasets.

bayesian meta-learning, name change, revisiting logistic-softmax likelihood, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels

Neural Information Processing SystemsDec-24-2025, 12:26:17 GMT

Recently, different machine learning methods have been introduced to tackle the challenging few-shot learning scenario that is, learning from a small labeled dataset related to a specific task. Common approaches have taken the form of meta-learning: learning to learn on the new problem given the old. Following the recognition that meta-learning is implementing learning in a multi-level model, we present a Bayesian treatment for the meta-learning inner loop through the use of deep kernels. As a result we can learn a kernel that transfers to new tasks; we call this Deep Kernel Transfer (DKT). This approach has many advantages: is straightforward to implement as a single optimizer, provides uncertainty quantification, and does not require estimation of task-specific parameters. We empirically demonstrate that DKT outperforms several state-of-the-art algorithms in few-shot classification, and is the state of the art for cross-domain adaptation and regression. We conclude that complex meta-learning routines can be replaced by a simpler Bayesian model without loss of accuracy.

bayesian meta-learning, few-shot, name change, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Revisiting Logistic-softmax Likelihood in Bayesian Meta-learning for Few-shot Classification

Neural Information Processing SystemsOct-8-2025, 20:58:13 GMT

Furthermore, we theoretically and empirically show that softmax can be viewed as a special case of logistic-softmax and logistic-softmax induces a larger family of data distribution than softmax.

artificial intelligence, likelihood, machine learning, (16 more...)

Neural Information Processing Systems

Country: Asia > China (0.04)

Genre: Research Report > New Finding (0.93)

Add feedback

Review for NeurIPS paper: Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels

Neural Information Processing SystemsFeb-4-2025, 22:26:09 GMT

Additional Feedback: I enjoyed reading this paper. Is using deep kernel learning a contribution of the paper? I believe the proposed method is applicable for any Gaussian process-style models. Where deep kernels help is to work with high-dimensional inputs such as images. Vanilla Gaussian processes (GP) are more suitable for a few data points due to the cubic computational complexity whereas deep networks are more suitable for big-data settings.

bayesian meta-learning, deep kernel, neurips paper, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Review for NeurIPS paper: Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels

Neural Information Processing SystemsFeb-4-2025, 22:26:01 GMT

The paper provides a nice adaptation of deep kernel learning to the few-shot setting, with promising performance over key deep learning baselines. Reviewers are united in their support for the work. Please carefully consider reviewer comments (and post rebuttal updates) in preparing final revisions.

bayesian meta-learning, deep kernel, neurips paper, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Revisiting Logistic-softmax Likelihood in Bayesian Meta-Learning for Few-Shot Classification

Neural Information Processing SystemsJan-19-2025, 03:26:43 GMT

Meta-learning has demonstrated promising results in few-shot classification (FSC) by learning to solve new problems using prior knowledge. Bayesian methods are effective at characterizing uncertainty in FSC, which is crucial in high-risk fields. In this context, the logistic-softmax likelihood is often employed as an alternative to the softmax likelihood in multi-class Gaussian process classification due to its conditional conjugacy property. However, the theoretical property of logistic-softmax is not clear and previous research indicated that the inherent uncertainty of logistic-softmax leads to suboptimal performance. To mitigate these issues, we revisit and redesign the logistic-softmax likelihood, which enables control of the \textit{a priori} confidence level through a temperature parameter. Furthermore, we theoretically and empirically show that softmax can be viewed as a special case of logistic-softmax and logistic-softmax induces a larger family of data distribution than softmax.

bayesian meta-learning, few-shot classification, revisiting logistic-softmax likelihood

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels

Neural Information Processing SystemsOct-11-2024, 05:03:19 GMT

Recently, different machine learning methods have been introduced to tackle the challenging few-shot learning scenario that is, learning from a small labeled dataset related to a specific task. Common approaches have taken the form of meta-learning: learning to learn on the new problem given the old. Following the recognition that meta-learning is implementing learning in a multi-level model, we present a Bayesian treatment for the meta-learning inner loop through the use of deep kernels. As a result we can learn a kernel that transfers to new tasks; we call this Deep Kernel Transfer (DKT). This approach has many advantages: is straightforward to implement as a single optimizer, provides uncertainty quantification, and does not require estimation of task-specific parameters.

bayesian meta-learning, deep kernel, few-shot

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Scalable Bayesian Meta-Learning through Generalized Implicit Gradients

Zhang, Yilang, Li, Bingcong, Gao, Shijian, Giannakis, Georgios B.

arXiv.org Artificial IntelligenceMar-30-2023

Meta-learning owns unique effectiveness and swiftness in tackling emerging tasks with limited data. Its broad applicability is revealed by viewing it as a bi-level optimization problem. The resultant algorithmic viewpoint however, faces scalability issues when the inner-level optimization relies on gradient-based iterations. Implicit differentiation has been considered to alleviate this challenge, but it is restricted to an isotropic Gaussian prior, and only favors deterministic meta-learning approaches. This work markedly mitigates the scalability bottleneck by cross-fertilizing the benefits of implicit differentiation to probabilistic Bayesian meta-learning. The novel implicit Bayesian meta-learning (iBaML) method not only broadens the scope of learnable priors, but also quantifies the associated uncertainty. Furthermore, the ultimate complexity is well controlled regardless of the inner-level optimization trajectory. Analytical error bounds are established to demonstrate the precision and efficiency of the generalized implicit gradient over the explicit one. Extensive numerical tests are also carried out to empirically validate the performance of the proposed method.

complexity, diag, ibaml, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1609/aaai.v37i9.26337

2303.17768

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.28)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Europe > France > Hauts-de-France > Nord > Lille (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
(2 more...)

Add feedback

Bayesian Active Meta-Learning for Few Pilot Demodulation and Equalization

Cohen, Kfir M., Park, Sangwoo, Simeone, Osvaldo, Shamai, Shlomo

arXiv.org Artificial IntelligenceDec-5-2022

Two of the main principles underlying the life cycle of an artificial intelligence (AI) module in communication networks are adaptation and monitoring. Adaptation refers to the need to adjust the operation of an AI module depending on the current conditions; while monitoring requires measures of the reliability of an AI module's decisions. Classical frequentist learning methods for the design of AI modules fall short on both counts of adaptation and monitoring, catering to one-off training and providing overconfident decisions. This paper proposes a solution to address both challenges by integrating meta-learning with Bayesian learning. As a specific use case, the problems of demodulation and equalization over a fading channel based on the availability of few pilots are studied. Meta-learning processes pilot information from multiple frames in order to extract useful shared properties of effective demodulators across frames. The resulting trained demodulators are demonstrated, via experiments, to offer better calibrated soft decisions, at the computational cost of running an ensemble of networks at run time. The capacity to quantify uncertainty in the model parameter space is further leveraged by extending Bayesian meta-learning to an active setting. In it, the designer can select in a sequential fashion channel conditions under which to generate data for meta-learning from a channel simulator. Bayesian active meta-learning is seen in experiments to significantly reduce the number of frames required to obtain efficient adaptation procedure for new frames.

artificial intelligence, bayesian inference, machine learning, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TSP.2022.3220035

2108.00785

Country:

North America > Canada > Quebec > Montreal (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(15 more...)

Genre: Research Report (1.00)

Industry:

Education (0.67)
Government (0.46)

Add feedback

Filters

Collaborating Authors

bayesian meta-learning

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

6cdb2cbb2083477cca5243843d6dad06-Paper-Conference.pdf

Revisiting Logistic-softmax Likelihood in Bayesian Meta-Learning for Few-Shot Classification

Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels

Revisiting Logistic-softmax Likelihood in Bayesian Meta-learning for Few-shot Classification

Review for NeurIPS paper: Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels

Review for NeurIPS paper: Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels

Revisiting Logistic-softmax Likelihood in Bayesian Meta-Learning for Few-Shot Classification

Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels

Scalable Bayesian Meta-Learning through Generalized Implicit Gradients

Bayesian Active Meta-Learning for Few Pilot Demodulation and Equalization