AITopics | online policy

Collaborating Authors

online policy

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Appendix No-regret Algorithms for Fair Resource Allocation

Neural Information Processing SystemsFeb-16-2026, 00:19:59 GMT

Wang et al. [ 2022 ] considered an online resource allocation problem where the

allocation, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

No-regret Algorithms for Fair Resource Allocation

Neural Information Processing SystemsFeb-16-2026, 00:19:55 GMT

Suppose a revenue-maximizing recommendation algorithm concludes from past data that more revenue is generated by showing the ad to Group A compared to Group B. In that case, the ad-serving algorithm will eventually end up showing that ad exclusively to Group A

artificial intelligence, data mining, machine learning, (19 more...)

Neural Information Processing Systems

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)
North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
(4 more...)

Industry: Education (0.47)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications > Social Media (0.93)
(2 more...)

Add feedback

Optimal Algorithms for Online Convex Optimization with Adversarial Constraints

Neural Information Processing SystemsFeb-12-2026, 10:16:54 GMT

Surprisingly, the analysis is short and elegant.

artificial intelligence, constraint function, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Asia > India > West Bengal > Kolkata (0.04)
Asia > India > Maharashtra > Mumbai (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry:

Education (0.46)
Banking & Finance (0.46)
Information Technology (0.46)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)

Add feedback

SupplementaryMaterial

Neural Information Processing SystemsFeb-11-2026, 19:37:35 GMT

Letπ0( |s)beaGaussianbehavioral reference policy with meanµ0(s) and variance σ20(s), and let π( |s) be an online policy with reparameterization at = fφ( t;st)andrandomvector t. Whilstentropyregularization partially mitigates the collapse of predictive variance away from the expert demonstrations, we still observe the wrong trend similar to Figure 1 with predictive variances high near the expert demonstrations andlowonunseen data. AWAC performs online fine-tuning of a policy pre-trained on offline. Themethod requires additional off-policy data to be generated to saturate the replay buffer, thereby requiring ahidden number ofenvironment interactions that donotinvolvelearning. To mitigate this, in practice, BRAC adds an entropy bonus to the supervised learning objective which stabilizes the variance around the training set but has no guarantees away from thedata.

artificial intelligence, machine learning, offline data, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

eecca5b6365d9607ee5a9d336962c534-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 19:37:31 GMT

behavioral policy, behavioral reference policy, predictive variance, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.05)
North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Education > Educational Setting > Online (0.32)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Robots (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.94)

Add feedback

Optimal Algorithms for Online Convex Optimization with Adversarial Constraints

Neural Information Processing SystemsOct-10-2025, 01:15:25 GMT

Surprisingly, the analysis is short and elegant.

assumption, constraint function, cost function, (14 more...)

Neural Information Processing Systems

Country:

Asia > India > West Bengal > Kolkata (0.04)
Asia > India > Maharashtra > Mumbai (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry:

Education (0.46)
Banking & Finance (0.46)
Information Technology (0.46)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)

Add feedback

96842011407c2691ab4eefff48fc864d-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 01:59:30 GMT

artificial intelligence, data mining, machine learning, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)
(5 more...)

Industry: Education (0.47)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications > Social Media (0.93)
(2 more...)

Add feedback

Supplementary Material T able of Contents

Neural Information Processing SystemsAug-18-2025, 16:49:48 GMT

A Laplace behavioral reference policy may be able to mitigate some of the problems posed by Proposition 1 due to the heavy tails of the distribution. Tikhonov regularization does not resolve the issue with calibration of uncertainties. A W AC performs online fine-tuning of a policy pre-trained on offline. BRAC regularizes the online policy against an offline behavioral policy as our method does. DAPG incorporates offline data into policy gradients by initially pre-training with a behaviorally cloned policy and then augmenting the RL loss with a supervised-learning loss.

artificial intelligence, behavioral policy, machine learning, (16 more...)

Neural Information Processing Systems

Industry: Education > Educational Setting > Online (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.50)

Add feedback