AITopics | Data Mining

Robust Conformal Prediction Using Privileged Information

Neural Information Processing SystemsMar-27-2025, 09:59:39 GMT

We develop a method to generate prediction sets with a guaranteed coverage rate that is robust to corruptions in the training data, such as missing or noisy variables. Our approach builds on conformal prediction, a powerful framework to construct prediction sets that are valid under the i.i.d assumption. Importantly, naively applying conformal prediction does not provide reliable predictions in this setting, due to the distribution shift induced by the corruptions. To account for the distribution shift, we assume access to privileged information (PI). The PI is formulated as additional features that explain the distribution shift, however, they are only available during training and absent at test time. We approach this problem by introducing a novel generalization of weighted conformal prediction and support our method with theoretical coverage guarantees. Empirical experiments on both real and synthetic datasets indicate that our approach achieves a valid coverage rate and constructs more informative predictions compared to existing methods, which are not supported by theoretical guarantees.

data mining, machine learning, prediction, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England (0.14)
North America > United States > California (0.14)
Europe > Middle East > Cyprus (0.14)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area (0.45)
Health & Medicine > Public Health (0.45)
Education > Educational Setting (0.45)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

86c17de05579cde52025f9984e6e2ebb-Paper-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 09:54:04 GMT

forecasting, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country: North America > United States (1.00)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

The Fairness-Quality Trade-off in Clustering

Neural Information Processing SystemsMar-27-2025, 09:52:11 GMT

Fairness in clustering has been considered extensively in the past; however, the trade-off between the two objectives -- e.g., can we sacrifice just a little in the quality of the clustering to significantly increase fairness, or vice-versa?

data mining, evolutionary algorithm, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Europe (0.14)
North America (0.14)

Genre: Research Report > Experimental Study (0.93)

Industry:

Transportation (0.46)
Health & Medicine (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.69)
(2 more...)

Add feedback

Model Shapley: Equitable Model Valuation with Black-box Access, Thanh Lam

Neural Information Processing SystemsMar-27-2025, 09:52:04 GMT

Valuation methods of data and machine learning (ML) models are essential to the establishment of AI marketplaces.

data mining, dirichlet abstraction, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America (0.28)

Genre: Research Report (0.68)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (0.93)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(5 more...)

Add feedback

8671b6dffc08b4fcf5b8ce26799b2bef-Paper-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 09:40:20 GMT

artificial intelligence, deep learning, machine learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Data Science > Data Mining (0.93)

Add feedback

a09e0dd6f92e402256725e15d3331811-Supplemental-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 09:39:25 GMT

artificial intelligence, data mining, machine learning, (18 more...)

Neural Information Processing Systems

Industry: Information Technology > Services (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Information Management (0.93)
Information Technology > Data Science > Data Mining > Big Data (0.46)

Add feedback

Product Ranking for Revenue Maximization with Multiple Purchases

Neural Information Processing SystemsMar-27-2025, 09:39:21 GMT

Product ranking is the core problem for revenue-maximizing online retailers. To design proper product ranking algorithms, various consumer choice models are proposed to characterize the consumers' behaviors when they are provided with a list of products. However, existing works assume that each consumer purchases at most one product or will keep viewing the product list after purchasing a product, which does not agree with the common practice in real scenarios. In this paper, we assume that each consumer can purchase multiple products at will. To model consumers' willingness to view and purchase, we set a random attention span and purchase budget, which determines the maximal amount of products that he/she views and purchases, respectively. Under this setting, we first design an optimal ranking policy when the online retailer can precisely model consumers' behaviors. Based on the policy, we further develop the Multiple-Purchase-with-Budget UCB (MPB-UCB) algorithms with Õ( T) regret that estimate consumers' behaviors and maximize revenue simultaneously in online settings. Experiments on both synthetic and semi-synthetic datasets prove the effectiveness of the proposed algorithms.

artificial intelligence, data mining, machine learning, (20 more...)

Neural Information Processing Systems

Industry:

Retail (1.00)
Information Technology > Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Information Management (0.93)
Information Technology > Enterprise Applications (0.76)
Information Technology > Data Science > Data Mining > Big Data (0.46)

Add feedback

86419aba4e5eafd2b1009a2e3c540bb0-Paper-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 09:34:06 GMT

artificial intelligence, information management, machine learning, (18 more...)

Neural Information Processing Systems

Country: Europe (0.28)

Genre: Research Report (0.67)

Industry:

Health & Medicine > Epidemiology (0.67)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.46)
Health & Medicine > Therapeutic Area > Immunology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications > Social Media (0.96)
(2 more...)

Add feedback

Reduction Algorithms for Persistence Diagrams of Networks: CoralTDA and PrunIT

Neural Information Processing SystemsMar-27-2025, 09:28:07 GMT

Topological data analysis (TDA) delivers invaluable and complementary information on the intrinsic properties of data inaccessible to conventional methods. However, high computational costs remain the primary roadblock hindering the successful application of TDA in real-world studies, particularly with machine learning on large complex networks. Indeed, most modern networks such as citation, blockchain, and online social networks often have hundreds of thousands of vertices, making the application of existing TDA methods infeasible. We develop two new, remarkably simple but effective algorithms to compute the exact persistence diagrams of large graphs to address this major TDA limitation.

data mining, machine learning, vertex, (17 more...)

Neural Information Processing Systems

Country:

Europe (1.00)
North America > Canada (0.68)
North America > United States > California (0.46)

Genre: Research Report > New Finding (0.68)

Industry:

Health & Medicine > Therapeutic Area > Neurology (0.93)
Information Technology (0.66)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications > Social Media (0.89)

Add feedback

Strategic Linear Contextual Bandits

Neural Information Processing SystemsMar-27-2025, 09:27:11 GMT

Motivated by the phenomenon of strategic agents gaming a recommender system to maximize the number of times they are recommended to users, we study a strategic variant of the linear contextual bandit problem, where the arms can strategically misreport privately observed contexts to the learner. We treat the algorithm design problem as one of mechanism design under uncertainty and propose the Optimistic Grim Trigger Mechanism (OptGTM) that incentivizes the agents (i.e., arms) to report their contexts truthfully while simultaneously minimizing regret. We also show that failing to account for the strategic nature of the agents results in linear regret. However, a trade-off between mechanism design and regret minimization appears to be unavoidable. More broadly, this work aims to provide insight into the intersection of online learning and mechanism design.

artificial intelligence, big data, data mining, (15 more...)

Neural Information Processing Systems

Country: