AITopics

Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection Bo Han

Neural Information Processing SystemsMar-21-2025, 18:57:58 GMT

Out-of-distribution (OOD) detection is crucial for deploying reliable machine learning models in open-world applications. Recent advances in CLIP-based OOD detection have shown promising results via regularizing prompt tuning with OOD features extracted from ID data. However, the irrelevant context mined from ID data can be spurious due to the inaccurate foreground-background decomposition, thus limiting the OOD detection performance. In this work, we propose a novel framework, namely, Self-Calibrated Tuning (SCT), to mitigate this problem for effective OOD detection with only the given few-shot ID data. Specifically, SCT introduces modulating factors respectively on the two components of the original learning objective. It adaptively directs the optimization process between the two tasks during training on data with different prediction uncertainty to calibrate the influence of OOD regularization, which is compatible with many prompt tuning based OOD detection methods. Extensive experiments and analyses have been conducted to characterize and demonstrate the effectiveness of the proposed SCT. The code is publicly available at: https://github.com/tmlr-group/SCT.

detection, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: Asia > China (0.28)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry:

Information Technology (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
(2 more...)

Add feedback

4f301ae934f396086bfefd1139039dbd-Paper-Conference.pdf

Neural Information Processing SystemsMar-21-2025, 18:57:47 GMT

artificial intelligence, machine learning, optimization problem, (12 more...)

Neural Information Processing Systems

Country:

Europe (1.00)
North America > United States > California (0.28)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixtureof Experts Simon Guo 4

Neural Information Processing SystemsMar-21-2025, 18:57:40 GMT

The Mixture of Experts (MoE) framework has become a popular architecture for large language models due to its superior performance over dense models. However, training MoEs from scratch in a large-scale regime is prohibitively expensive. Existing methods mitigate this by pre-training multiple dense expert models independently and using them to initialize an MoE. This is done by using experts' feed-forward network (FFN) to initialize the MoE's experts while merging other parameters. However, this method limits the reuse of dense model parameters to only the FFN layers, thereby constraining the advantages when "upcycling" these models into MoEs.

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Genre: Research Report (1.00)

Industry: Law (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

c7c3e78e3c9d26cc1158a8735d548eaa-Supplemental.pdf

Neural Information Processing SystemsMar-21-2025, 18:57:33 GMT

approximation, artificial intelligence, machine learning, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.47)

Add feedback

Bias and variance of the Bayesian-mean decoder

Neural Information Processing SystemsMar-21-2025, 18:57:29 GMT

Perception, in theoretical neuroscience, has been modeled as the encoding of external stimuli into internal signals, which are then decoded. The Bayesian mean is an important decoder, as it is optimal for purposes of both estimation and discrimination. We present widely-applicable approximations to the bias and to the variance of the Bayesian mean, obtained under the minimal and biologicallyrelevant assumption that the encoding results from a series of independent, though not necessarily identically-distributed, signals. Simulations substantiate the accuracy of our approximations in the small-noise regime. The bias of the Bayesian mean comprises two components: one driven by the prior, and one driven by the precision of the encoding.

artificial intelligence, bayesian mean, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.68)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Add feedback

f0b76267fbe12b936bd65e203dc675c1-Supplemental.pdf

Neural Information Processing SystemsMar-21-2025, 18:57:22 GMT

artificial intelligence, expression, machine learning, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

Sparse and Continuous Attention Mechanisms André F. T. Martins,, Ant os Treviso Vlad Niculae, Pe

Neural Information Processing SystemsMar-21-2025, 18:57:15 GMT

Exponential families are widely used in machine learning; they include many distributions in continuous and discrete domains (e.g., Gaussian, Dirichlet, Poisson, and categorical distributions via the softmax transformation). Distributions in each of these families have fixed support. In contrast, for finite domains, there has been recent work on sparse alternatives to softmax (e.g.

artificial intelligence, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country: Europe (0.94)

Industry: Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.89)

Add feedback

f0b76267fbe12b936bd65e203dc675c1-AuthorFeedback.pdf

Neural Information Processing SystemsMar-21-2025, 18:57:04 GMT

Note that the VQA results in Table 2 with continuous attention use fewer basis functions than discrete regions. Good idea, we will add this to the camera-ready version. Is this a necessary or a sufficient condition?" Sufficient; we will clarify and follow the suggestions (move the beta-escort definition to the main text and fix typos). We will add a citation. We chose ridge regression as it enables a closed-form solution expressed linearly in terms of the basis functions (Eq. We haven't tried linear interpolation, However, for a high-level vision system, combining our method with BUTD is an interesting idea. Text are naturally discrete tokens."

artificial intelligence, continuous attention, machine learning, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.38)

Add feedback

3f52ab4322e967efd312c38a68d07f01-Paper-Conference.pdf

Neural Information Processing SystemsMar-21-2025, 18:57:02 GMT

artificial intelligence, machine learning, taxonomy, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)

Genre: Research Report (1.00)

Industry: Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
(3 more...)

Add feedback

Supplement: Matrix Completion with Quantified Uncertainty through Low Rank Gaussian Copula

Neural Information Processing SystemsMar-21-2025, 18:56:53 GMT

For the first equality, we use Eq. In practice, the result is more useful for small d, such as d = 0. Let us first state a generalization of our Theorem 2. Theorem 4. Suppose x LRGC(W, σ The proof applies to each missing dimension j M. Let us further define s For a detailed treatment of sub-Gaussian random distributions, see [10]. K p for all p 1 with some K > 0. The sub-Gaussian norm of x is defined as ||x|| Our Lemma 2 is Lemma 17 in [1], which is also a simplified version of Theorem 1 in [4]. To compute (2) and (3), we use the law of total expectation similar as in Section 1.1 by first treating z R. The computation for all cases are similar. We take the first case as an example.

artificial intelligence, machine learning, ordinal data, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Data Science (0.68)
Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback

Filters

Collaborating Authors

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection Bo Han

4f301ae934f396086bfefd1139039dbd-Paper-Conference.pdf

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixtureof Experts Simon Guo 4

c7c3e78e3c9d26cc1158a8735d548eaa-Supplemental.pdf

Bias and variance of the Bayesian-mean decoder

f0b76267fbe12b936bd65e203dc675c1-Supplemental.pdf

Sparse and Continuous Attention Mechanisms André F. T. Martins,, Ant os Treviso Vlad Niculae, Pe

f0b76267fbe12b936bd65e203dc675c1-AuthorFeedback.pdf

3f52ab4322e967efd312c38a68d07f01-Paper-Conference.pdf

Supplement: Matrix Completion with Quantified Uncertainty through Low Rank Gaussian Copula