AITopics | ytest

Collaborating Authors

ytest

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Conformal Language Modeling via Posterior Sampling

Emmenegger, Nicolas, Olausson, Theo X., Solar-Lezama, Armando, Podimata, Chara

arXiv.org Machine LearningJun-3-2026

Large Language Models remain plagued by hallucinations. Recent work has sought to tame their prevalence using statistical techniques based on conformal prediction, with both theoretical and empirical success. However, these methods operate in a post-hoc fashion, treating the sampling procedure itself as atomic and then surgically altering samples to remove hallucinated claims. This disconnect between filtering and generation can result in samples that are incoherent, inconsistent, or simply unlikely under the model itself. Moreover, post-hoc surgery is unable to shift probability mass towards more useful and helpful responses. To address these issues, we propose to instead sample from approximations to an LLM posterior, where the conditioning event corresponds to a calibrated, high-scoring region. We develop a calibration procedure tailored to the setting of conditional sequential generation that effectively identifies this region and achieves target risk control. Empirically, we apply our method to case studies focused on open-ended biography generation and mathematical problem solving; compared to prior work, we obtain the same statistical guarantees, with higher downstream utility.

artificial intelligence, large language model, natural language, (21 more...)

arXiv.org Machine Learning

2606.03731

Country: North America > United States (0.28)

Genre:

Questionnaire & Opinion Survey (0.93)
Research Report (0.64)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.83)

Add feedback

Towards understanding retrosynthesis by energy-based models

Neural Information Processing SystemsApr-25-2026, 23:04:32 GMT

Retrosynthesis is the process of identifying a set of reactants to synthesize a target molecule. It is critical to material design and drug discovery. Existing machine learning approaches based on language models and graph neural networks have achie rarely ved discussed, encouraging and rigorous results. Ho evaluations wever, the of inner these connections models are of lar these gely in models need.

machine learning, natural language, template, (19 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

Beyond Fixed False Discovery Rates: Post-Hoc Conformal Selection with E-Variables

Zhu, Meiyi, Simeone, Osvaldo

arXiv.org Machine LearningApr-20-2026

Conformal selection (CS) uses calibration data to identify test inputs whose unobserved outcomes are likely to satisfy a pre-specified minimal quality requirement, while controlling the false discovery rate (FDR). Existing methods fix the target FDR level before observing data, which prevents the user from adapting the balance between number of selected test inputs and FDR to downstream needs and constraints based on the available data. For example, in genomics or neuroimaging, researchers often inspect the distribution of test statistics, and decide how aggressively to pursue candidates based on observed evidence strength and available follow-up resources. To address this limitation, we introduce {post-hoc CS} (PH-CS), which generates a path of candidate selection sets, each paired with a data-driven false discovery proportion (FDP) estimate. PH-CS lets the user select any operating point on this path by maximizing a user-specified utility, arbitrarily balancing selection size and FDR. Building on conformal e-variables and the e-Benjamini-Hochberg (e-BH) procedure, PH-CS is proved to provide a finite-sample post-hoc reliability guarantee whereby the ratio between estimated FDP level and true FDP is, on average, upper bounded by $1$, so that the average estimated FDP is, to first order, a valid upper bound on the true FDR. PH-CS is extended to control quality defined in terms of a general risk. Experiments on synthetic and real-world datasets demonstrate that, unlike CS, PH-CS can consistently satisfy user-imposed utility constraints while producing reliable FDP estimates and maintaining competitive FDR control.

artificial intelligence, machine learning, selection, (16 more...)

arXiv.org Machine Learning

2604.11305

Country:

Asia > Middle East > Jordan (0.04)
North America > United States (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.48)
Health & Medicine > Diagnostic Medicine (0.48)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

f50f282a3093d36471008b045bd478af-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 21:13:05 GMT

In this way, even with limited meta samples, MTG-Net holds the potential to produce reasonable gain estimations on arbitrary task combinations.

artificial intelligence, machine learning, task combination, (18 more...)

Neural Information Processing Systems

Genre: Research Report (0.47)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.47)

Add feedback

ProvablyRobustMetricLearning

Neural Information Processing SystemsFeb-10-2026, 19:19:36 GMT

Experimental results showthattheproposed metriclearning algorithm improves both certified robust errors and empirical robust errors (errors under adversarial attacks).

artificial intelligence, machine learning, perturbation, (19 more...)

Neural Information Processing Systems

Country:

Asia > China (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Russia (0.04)
(2 more...)

Industry:

Government (0.36)
Information Technology (0.36)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

e038453073d221a4f32d0bab94ca7cee-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-10-2026, 19:19:25 GMT

We fully understand the concern about our baselines since2 we are the first to improve certified robustness of metric learning. Therefore, as Reviewer 4 suggested, we add3 experiments comparing with neural networks certification methods, including ordinary neural networks certified by4 CROWN [48] and randomized-smoothing neural networks [11]. The results are shown in Figure i. In general, computational11 cost is not an issue for ARML. To make the comparison fair, all of15 the methods are run on CPU (Xeon(R) E5-2620 v4 @16 2.10GHz).

artificial intelligence, machine learning, metric learning, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)

Add feedback

M4I: Multi-modalModels Membership Inference

Neural Information Processing SystemsFeb-7-2026, 10:35:17 GMT

ROUGE-N scores are the overlapping of n-grams [2] between the generated and referencesequence. Those scores are then averaged overthe whole corpus toreach anoverall quality. For both proposed MMMMI attack methods, shadow models are indispensable. The first hidden layer in the attack model has 256 units and the second hidden layer has20units, bothactivatedbyReLU function. We used resnet-LSTM architecture as the target model architecture.

artificial intelligence, dataset, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Oceania > Australia (0.09)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.99)

Add feedback