AITopics | Statistical Learning

In particular, we consider group-structured and boundedfdivergence uncertainty sets. Our approach relies on an accelerated method that queries a ball optimization oracle, i.e., a subroutine that minimizes the objective within a small ball around the query point. Our main contribution is efficient implementations of this oracle for DRO objectives.

artificial intelligence, complexity, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom (0.04)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

e90b00adc3ba130eb2510d93ba3ff250-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 14:31:57 GMT

complexity, estimator, optimization, (13 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Add feedback

Causal Inference and Mechanism Clustering of A Mixture of Additive Noise Models

Shoubo Hu, Zhitang Chen, Vahid Partovi Nia, Laiwan CHAN, Yanhui Geng

Neural Information Processing SystemsFeb-12-2026, 14:31:20 GMT

Obviously in ANM-MM, all observations are generated by a set of g.m.s, which share the same functionform(f)butdifferinparametervalues(θ).

anm-mm, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)

Add feedback

Wasserstein Distributionally Robust Optimization Through the Lens of Structural Causal Models and Individual Fairness

Neural Information Processing SystemsFeb-12-2026, 14:22:24 GMT

To address this gap, we first formulate the DRO problem from causality and individual fairness perspectives.

artificial intelligence, intervention, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
North America > Canada > Quebec > Montreal (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science (0.92)

Add feedback

ATOMO: Communication-efficient Learning via Atomic Sparsification

Hongyi Wang, Scott Sievert, Shengchao Liu, Zachary Charles, Dimitris Papailiopoulos, Stephen Wright

Neural Information Processing SystemsFeb-12-2026, 14:22:13 GMT

Neural Information Processing Systems http://nips.cc/

arxiv preprint arxiv, decomposition, gradient, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.49)

Add feedback

The promises and pitfalls of Stochastic Gradient Langevin Dynamics

Nicolas Brosse, Alain Durmus, Eric Moulines

Neural Information Processing SystemsFeb-12-2026, 14:17:26 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, sgld, wasserstein distance, (10 more...)

Neural Information Processing Systems

Country:

Europe > France (0.04)
Asia > China (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.75)

Add feedback

Gradient Sparsification for Communication-Efficient Distributed Optimization

Jianqiao Wangni, Jialei Wang, Ji Liu, Tong Zhang

Neural Information Processing SystemsFeb-12-2026, 14:16:14 GMT

In the synchronous stochastic gradient method, each worker processes a random minibatch of its training data, and then the local updates are synchronized by making anAll-Reduce step, which aggregates stochastic gradients from all workers, and taking aBroadcast step that transmits the updated parameter vector back toallworkers.

artificial intelligence, gradient descent, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.58)

Add feedback

Filters

Collaborating Authors

Statistical Learning

Single-Model Uncertainties for Deep Learning

60bc87f3cf5257579435d92ec12c761b-Supplemental-Datasets_and_Benchmarks.pdf

Greedy Sampling for Approximate Clustering in the Presence of Outliers

DistributionallyRobustOptimizationviaBallOracle Acceleration

e90b00adc3ba130eb2510d93ba3ff250-Paper-Conference.pdf

Causal Inference and Mechanism Clustering of A Mixture of Additive Noise Models

Wasserstein Distributionally Robust Optimization Through the Lens of Structural Causal Models and Individual Fairness

ATOMO: Communication-efficient Learning via Atomic Sparsification

The promises and pitfalls of Stochastic Gradient Langevin Dynamics

Gradient Sparsification for Communication-Efficient Distributed Optimization