AITopics | Overview

The sample complexities of our algorithms can be quantified in terms of well-known quantities like the extended teaching dimension and haystack dimension.

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)

Genre:

Research Report (0.68)
Overview (0.68)

Industry: Education > Educational Setting > Online (0.41)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

b30958093daeed059670b35173654dc9-Supplemental.pdf

Neural Information Processing SystemsAug-15-2025, 21:30:19 GMT

comparison system, convergence, q-learning, (13 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Massachusetts > Middlesex County > Belmont (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(2 more...)

Genre: Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

A Unified Switching System Perspective and Convergence Analysis of Q-Learning Algorithms

Neural Information Processing SystemsAug-15-2025, 21:30:11 GMT

However, its application to Q-learning has been limited due to the presence of the max-operator, which makes the associated ODE model a complex nonlinear system. In contrast, the associated ODE of TD learning for policy evaluation is a linear system, whose asymptotic stability is much easier to analyze in general.

algorithm, convergence, q-learning, (10 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Massachusetts > Middlesex County > Belmont (0.04)
North America > Canada (0.04)
(2 more...)

Genre: Overview (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

af5d5ef24881f3c3049a7b9bfe74d58b-Paper.pdf

Neural Information Processing SystemsAug-15-2025, 19:59:29 GMT

algorithm, constraint, international conference, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.04)
Asia > Middle East > Jordan (0.04)
Asia > China > Shanghai > Shanghai (0.04)
(5 more...)

Genre:

Research Report (0.69)
Overview (0.46)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Add feedback

Policy Optimization with Linear Temporal Logic Constraints Cameron V oloshin Caltech Hoang M. Le Argo AI Swarat Chaudhuri UT Austin Yisong Yue Argo AI Caltech

Neural Information Processing SystemsAug-15-2025, 18:41:31 GMT

However, capturing real-world task specifications using scalar costs can be challenging. For one, real-world tasks often consist of objectives that are required, as well as those that are merely desirable.

cost function, reinforcement learning, specification, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia > Arlington County > Arlington (0.04)
North America > United States > Michigan (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(3 more...)

Genre: Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Robots (0.68)

Add feedback

TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation

Neural Information Processing SystemsAug-15-2025, 18:12:12 GMT

Noun prototypes are generated in an unsupervised manner and contextual pronoun features are trained to select prototypes. As such, the network remains noun-agnostic during inference.

artificial intelligence, machine learning, natural language, (14 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Genre:

Research Report (0.68)
Overview (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
(4 more...)

Add feedback

Learning the Linear Quadratic Regulator from Nonlinear Observations

Neural Information Processing SystemsAug-15-2025, 15:32:39 GMT

The learner's goal is to P AC-learn an

assumption, decoder, probability, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
North America > Canada (0.04)

Genre:

Workflow (0.46)
Overview (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science (0.92)

Add feedback

On Inductive Biases for Heterogeneous Treatment Effect Estimation Appendix

Neural Information Processing SystemsAug-15-2025, 15:17:00 GMT

Here, we present a detailed overview of existing model-agnostic "meta-learner" strategies for CA TE Unfortunately, good performance on estimation of the POs is not sufficient. Note that, as we discuss in section C.2, we fixed all hyperparameters throughout all experiments as tuning Input: Testing data X Trained FlexTENet flex for i 1: flex.n_layers We retrieve the data from https://jenniferhill7.wixsite.com/acic-2016/competition "D" we change only the response surface of the treated to As stated in the main text, we fixed equivalent hyperparameters across all methods within any experiments to not conflate hyperparameter tuning with the value of the different strategies. B (D.3), present additional results on PO estimation (D.4), and then move to analyzing the learned We also consider the effect of using our approaches as first-stage (nuisance) estimators for two-step learners (D.6).

artificial intelligence, machine learning, setup, (18 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)

Genre:

Research Report > Experimental Study (1.00)
Overview (0.66)

Industry: Health & Medicine > Public Health (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Filters

Collaborating Authors

Overview

Teaching an Active Learner with Contrastive Examples Chaoqi Wang University of Chicago

94aada62f90dd50a84ca74304563d5db-Supplemental.pdf

Practical, Provably-Correct Interactive Learning in the Realizable Setting: The Power of True Believers

b30958093daeed059670b35173654dc9-Supplemental.pdf

A Unified Switching System Perspective and Convergence Analysis of Q-Learning Algorithms

af5d5ef24881f3c3049a7b9bfe74d58b-Paper.pdf

Policy Optimization with Linear Temporal Logic Constraints Cameron V oloshin Caltech Hoang M. Le Argo AI Swarat Chaudhuri UT Austin Yisong Yue Argo AI Caltech

TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation

Learning the Linear Quadratic Regulator from Nonlinear Observations

On Inductive Biases for Heterogeneous Treatment Effect Estimation Appendix