AITopics

A hallmark property of explainable AI models is the ability to teach other agents, communicating knowledge of how to perform a task. While Large Language Models (LLMs) perform complex reasoning by generating explanations for their predictions, it is unclear whether they also make good teachers for weaker agents. To address this, we consider a student-teacher framework between two LLM agents and study if, when, and how the teacher should intervene with natural language explanations to improve the student's performance. Since communication is expensive, we define a budget such that the teacher only communicates explanations for a fraction of the data, after which the student should perform well on its own. We decompose the teaching problem along four axes: (1) if teacher's test time intervention improve student predictions, (2) when it is worth explaining a data point, (3) how the teacher should personalize explanations to better teach the student, and (4) if teacher explanations also improve student performance on future unexplained data.

explanation, large language model, natural language, (17 more...)

Neural Information Processing Systems

Country: North America > United States > Georgia > Dougherty County > Albany (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Education (1.00)
Government > Regional Government > North America Government > United States Government (0.67)
Health & Medicine > Therapeutic Area > Obstetrics/Gynecology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.68)

Add feedback

A The Algorithm of CPTPP embeddings to U

Neural Information Processing SystemsMay-25-2025, 11:21:55 GMT

We list detailed hyper-parameter settings here for reproducibility.

artificial intelligence, cptpp, machine learning, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

c6af791af7ef0f3e02bccef011211ca5-Paper-Conference.pdf

Neural Information Processing SystemsMay-25-2025, 11:21:52 GMT

artificial intelligence, machine learning, recommendation, (17 more...)

Neural Information Processing Systems

Country: North America > United States > Virginia (0.14)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Certified Minimax Unlearning with Generalization Rates and Deletion Capacity

Neural Information Processing SystemsMay-25-2025, 11:19:31 GMT

We study the problem of (ϵ, δ)-certified machine unlearning for minimax models. Most of the existing works focus on unlearning from standard statistical learning models that have a single variable and their unlearning steps hinge on the direct Hessian-based conventional Newton update. We develop a new (ϵ, δ)-certified machine unlearning algorithm for minimax models. It proposes a minimax unlearning step consisting of a total Hessian-based complete Newton update and the Gaussian mechanism borrowed from differential privacy. To obtain the unlearning certification, our method injects calibrated Gaussian noises by carefully analyzing the "sensitivity" of the minimax unlearning step (i.e., the closeness between the minimax unlearning variables and the retraining-from-scratch variables).

artificial intelligence, machine learning, minimax, (15 more...)

Neural Information Processing Systems

Country: Asia > China (0.14)

Industry:

Information Technology > Security & Privacy (1.00)
Law (0.93)
Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

c67b138497305835e76fdedd48dd4e59-Paper-Conference.pdf

Neural Information Processing SystemsMay-25-2025, 11:19:18 GMT

artificial intelligence, excess risk, machine learning, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)

Add feedback

Data-Dependent Bounds for Online Portfolio Selection Without Lipschitzness and Smoothness Chung-En Tsai Department of Computer Science and Information Engineering National Taiwan University

Neural Information Processing SystemsMay-25-2025, 11:19:04 GMT

This work introduces the first small-loss and gradual-variation regret bounds for online portfolio selection, marking the first instances of data-dependent bounds for online convex optimization with non-Lipschitz, non-smooth losses. The algorithms we propose exhibit sublinear regret rates in the worst cases and achieve logarithmic regrets when the data is "easy," with per-round time almost linear in the number of investment alternatives. The regret bounds are derived using novel smoothness characterizations of the logarithmic loss, a local norm-based analysis of following the regularized leader (FTRL) with self-concordant regularizers, which are not necessarily barriers, and an implicit variant of optimistic FTRL with the log-barrier.

artificial intelligence, lemma 4, machine learning, (17 more...)

Neural Information Processing Systems

Country: Asia > Taiwan (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Communications > Web > Semantic Web (0.40)

Add feedback

Sensitivity in Translation Averaging

Neural Information Processing SystemsMay-25-2025, 11:18:50 GMT

In 3D computer vision, translation averaging solves for absolute translations given a set of pairwise relative translation directions. While there has been much work on robustness to outliers and studies on the uniqueness of the solution, this paper deals with a distinctly different problem of sensitivity in translation averaging under uncertainty. We first analyze sensitivity in estimating scales corresponding to relative directions under small perturbations of the relative directions. Then, we formally define the conditioning of the translation averaging problem, which assesses the reliability of estimated translations based solely on the input directions. We give a sufficient criterion to ensure that the problem is well-conditioned. Subsequently, we provide an efficient algorithm to identify and remove combinations of directions which make the problem ill-conditioned while ensuring uniqueness of the solution. We demonstrate the utility of such analysis in global structure-frommotion pipelines for obtaining 3D reconstructions, which reveals the benefits of filtering the ill-conditioned set of directions in translation averaging in terms of reduced translation errors, a higher number of 3D points triangulated and faster convergence of bundle adjustment.

artificial intelligence, triangle, triplet, (18 more...)

Neural Information Processing Systems

Country: