AITopics | hopper

Collaborating Authors

hopper

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A Hyperparameter Settings of RD

Neural Information Processing SystemsFeb-15-2026, 13:03:00 GMT

In this section, we describe details about hyperparameter setting of RD. SAC-N-Unc and TD3-N-Unc, M is set to 1/10 of the total training steps. To ensure fairness, algorithms employing RD are implemented using CORL repository [54]. By modifying the original SAC/TD3 algorithm to employ a critic ensemble of number N and incorporate an uncertainty regularization term within the policy update process, we derive these backbone algorithms. Additionally, using RD with fewer Q ensembles can achieve similar or even better results than the backbone methods using more Q ensembles, indicating its potential in reducing computing resource consumption.

algorithm, artificial intelligence, machine learning, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.48)

Add feedback

SAD-Flower: Flow Matching for Safe, Admissible, and Dynamically Consistent Planning

Huang, Tzu-Yuan, Lederer, Armin, Wu, Dai-Jie, Dai, Xiaobing, Zhang, Sihua, Sosnowski, Stefan, Sun, Shao-Hua, Hirche, Sandra

arXiv.org Artificial IntelligenceDec-2-2025

Flow matching (FM) has shown promising results in data-driven planning. However, it inherently lacks formal guarantees for ensuring state and action constraints, whose satisfaction is a fundamental and crucial requirement for the safety and admissibility of planned trajectories on various systems. Moreover, existing FM planners do not ensure the dynamical consistency, which potentially renders trajectories inexecutable. We address these shortcomings by proposing SAD-Flower, a novel framework for generating Safe, Admissible, and Dynamically consistent trajectories. Our approach relies on an augmentation of the flow with a virtual control input. Thereby, principled guidance can be derived using techniques from nonlinear control theory, providing formal guarantees for state constraints, action constraints, and dynamic consistency. Crucially, SAD-Flower operates without retraining, enabling test-time satisfaction of unseen constraints. Through extensive experiments across several tasks, we demonstrate that SAD-Flower outperforms various generative-model-based baselines in ensuring constraint satisfaction.

constraint, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2511.05355

Country:

Europe > United Kingdom > North Sea > Southern North Sea (0.04)
North America > United States > Utah (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(5 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.92)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Causal Confusion in Imitation Learning

Neural Information Processing SystemsNov-17-2025, 20:23:16 GMT

Behavioral cloning reduces policy learning to supervised learning by training a discriminative model to predict expert actions given observations.

artificial intelligence, causal misidentification, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > New Jersey > Middlesex County > Piscataway (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report (0.68)

Industry: Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

947018640bf36a2bb609d3557a285329-AuthorFeedback.pdf

Neural Information Processing SystemsNov-17-2025, 20:23:01 GMT

artificial intelligence, intervention, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Appendix Table of Contents

Neural Information Processing SystemsNov-16-2025, 01:48:43 GMT

The number of layers is 12 for GPT2 and randomly initialized model and 24 for iGPT. Note that these notations are sometimes used interchangeably as long as it doesn't significantly The activation to be analyzed are outputs from all layers . CKA about is shown in Figure 1. The design of the diagram is based on a previous study [35]. Figure 11: Activation we consider to compute CKA.

artificial intelligence, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country: North America > Mexico > Gulf of Mexico (0.14)

Genre: Collection (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.53)

Add feedback

f0eb6568ea114ba6e293f903c34d7488-AuthorFeedback.pdf

Neural Information Processing SystemsNov-15-2025, 15:14:21 GMT

adversary, artificial intelligence, machine learning, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

Supplementary Material for BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning A Proofs of Theorems

Neural Information Processing SystemsNov-15-2025, 09:54:00 GMT

BAIL includes a regularization scheme to prevent over-fitting when generating the upper envelope. We refer to it as an "early stopping scheme" because the key idea is to return to the parameter values which gave the lowest validation error (see Section 7.8 of Goodfellow et al. Details are provided in Table 1. Table 1: BAIL hyper-parameters Parameter V alue discount rate γ 0. 99 horizon T 1000 training set size 0. 8 |B| validation set size 0. 2 |B| optimizer Adam [4] percentage p % 30% for BAIL 25% for Progressive BAIL upper envelope network structure 128 128 hidden units, ReLU activation learning rate 3 10 We use five MuJoCo environments, including Humanoid, which is the most challenging of the MuJoCo environments, and is not attempted in most other papers on batch DRL. The BCQ paper [2] also uses the same hyper-parameters for all experiments.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country: North America > Canada (0.04)

Genre: Research Report > New Finding (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Supplementary Material A Details on experimental setups A.1 Environments

Neural Information Processing SystemsNov-14-2025, 15:04:54 GMT

One can observe that transition dynamics follow multi-modal distributions. We visualize the transitions in Figure 8a. The objective of Pendulum is to swing up the pole and keep the pole upright within 200 time steps. Hopper is to move forward as fast as possible while minimizing the action cost within 500 time steps. We visualize the transitions in Figure 8d.

artificial intelligence, machine learning, slimhumanoid, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)

Add feedback