AITopics | high confidence

Beyond Prediction: Managing the Repercussions of Machine Learning Applications

Neural Information Processing SystemsJun-14-2026, 19:03:07 GMT

Machine learning models are often designed to maximize a primary goal, such as accuracy. However, as these models are increasingly used to inform decisions that affect people's lives or well-being, it is often unclear what the real-world repercussions of their deployment might be--making it crucial to understand and manage such repercussions effectively. Models maximizing user engagement on social media platforms, e.g., may inadvertently contribute to the spread of misinformation and content that deepens political polarization. This issue is not limited to social media--it extends to other applications where machine learning-informed decisions can have real-world repercussions, such as education, employment, and lending. Existing methods addressing this issue require prior knowledge or estimates of analytical models describing the relationship between a classifier's predictions and their corresponding repercussions. We introduce THEIA, a novel classification algorithm capable of optimizing a primary objective, such as accuracy, while providing high-confidence guarantees about its potential repercussions. Importantly, THEIA solves the open problem of providing such guarantees based solely on existing data with observations of previous repercussions. We prove that it satisfies constraints on a model's repercussions with high confidence and that it is guaranteed to identify a solution, if one exists, given sufficient data. We empirically demonstrate, using real-life data, that THEIA can identify models that achieve high accuracy while ensuring, with high confidence, that constraints on their repercussions are satisfied.

artificial intelligence, machine learning, social media, (19 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts (0.28)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry:

Banking & Finance (1.00)
Education (0.86)
Media > News (0.34)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.45)

Add feedback

05dc08730e32441edff52b0fa6caab5f-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 08:36:21 GMT

artificial intelligence, machine learning, segmentation, (16 more...)

Neural Information Processing Systems

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (0.74)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.76)
Information Technology > Sensing and Signal Processing > Image Processing (0.55)

Add feedback

029df12a9363313c3e41047844ecad94-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 05:58:38 GMT

There is a road and there are many atoms and trees beside it and there is a building in the right corner.

artificial intelligence, information management, natural language, (15 more...)

Neural Information Processing Systems

Country:

Europe (1.00)
North America > United States > California > San Francisco County > San Francisco (0.15)

Industry:

Health & Medicine (0.94)
Transportation > Ground > Rail (0.93)

Technology:

Information Technology > Information Management > Search (0.70)
Information Technology > Artificial Intelligence > Natural Language (0.69)
Information Technology > Sensing and Signal Processing > Image Processing (0.49)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.47)

Add feedback

Exploring Geometry of Blind Spots in Vision Models

Neural Information Processing SystemsFeb-15-2026, 20:43:26 GMT

We propose a Level Set Traversal algorithm that iteratively explores regions of high confidence with respect to the input space using orthogonal components of the local gradients.

artificial intelligence, machine learning, source image, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland > Prince George's County > College Park (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Supplementary 1 A Details of sampling process

Neural Information Processing SystemsFeb-15-2026, 15:12:43 GMT

Most of the breeds will have the first type of collapse as in Figure 5, 6, and 7. Figure 5, 6 shows the

artificial intelligence, guidance, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Europe (0.04)
Asia > Middle East > Israel (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (0.69)
Information Technology > Sensing and Signal Processing > Image Processing (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

f4d4a021f9051a6c18183b059117e8b5-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsFeb-12-2026, 21:08:33 GMT

annotation, detection, professional labeler, (15 more...)

Neural Information Processing Systems

Country:

Atlantic Ocean > South Atlantic Ocean > Gulf of Guinea (0.08)
Africa > Gulf of Guinea (0.08)
Europe > Norway (0.07)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.97)

Add feedback

05dc08730e32441edff52b0fa6caab5f-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 10:42:55 GMT

dataset, medical image segmentation, segmentation, (14 more...)

Neural Information Processing Systems

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (0.74)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.76)
Information Technology > Sensing and Signal Processing > Image Processing (0.55)

Add feedback

029df12a9363313c3e41047844ecad94-Supplemental-Conference.pdf

Neural Information Processing SystemsDec-27-2025, 17:46:46 GMT

caption, knowledge, query, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.15)
Europe > Austria > Vienna (0.14)
Europe > Sweden > Stockholm > Stockholm (0.06)
(23 more...)

Genre: Workflow (0.65)

Industry:

Health & Medicine (0.94)
Transportation > Ground > Rail (0.93)

Technology:

Information Technology > Information Management > Search (0.69)
Information Technology > Artificial Intelligence > Natural Language (0.69)
Information Technology > Sensing and Signal Processing > Image Processing (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.47)

Add feedback

Prompt4Trust: A Reinforcement Learning Prompt Augmentation Framework for Clinically-Aligned Confidence Calibration in Multimodal Large Language Models

Kriz, Anita, Janes, Elizabeth Laura, Shen, Xing, Arbel, Tal

arXiv.org Artificial IntelligenceOct-14-2025

Multimodal large language models (MLLMs) hold considerable promise for applications in healthcare. However, their deployment in safety-critical settings is hindered by two key limitations: (i) sensitivity to prompt design, and (ii) a tendency to generate incorrect responses with high confidence. As clinicians may rely on a model's stated confidence to gauge the reliability of its predictions, it is especially important that when a model expresses high confidence, it is also highly accurate. We introduce Prompt4Trust, the first reinforcement learning (RL) framework for prompt augmentation targeting confidence calibration in MLLMs. A lightweight LLM is trained to produce context-aware auxiliary prompts that guide a downstream task MLLM to generate responses in which the expressed confidence more accurately reflects predictive accuracy. Unlike conventional calibration techniques, Prompt4Trust specifically prioritizes aspects of calibration most critical for safe and trustworthy clinical decision-making. Beyond improvements driven by this clinically motivated calibration objective, our proposed method also improves task accuracy, achieving state-of-the-art medical visual question answering (VQA) performance on the PMC-VQA benchmark, which is composed of multiple-choice questions spanning diverse medical imaging modalities. Moreover, our framework trained with a small downstream task MLLM showed promising zero-shot generalization to larger MLLMs in our experiments, suggesting the potential for scalable calibration without the associated computational costs. This work demonstrates the potential of automated yet human-aligned prompt engineering for improving the the trustworthiness of MLLMs in safety critical settings. Our codebase can be found at https://github.com/xingbpshen/prompt4trust.

artificial intelligence, large language model, natural language, (16 more...)

arXiv.org Artificial Intelligence

2507.09279

Country: