AITopics | env

Collaborating Authors

env

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Multi-Agent Common Knowledge Reinforcement Learning

Christian Schroeder de Witt, Jakob Foerster, Gregory Farquhar, Philip Torr, Wendelin Boehmer, Shimon Whiteson

Neural Information Processing SystemsNov-19-2025, 14:52:05 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > Canada (0.04)
Europe > Denmark (0.04)
Africa > Sudan (0.04)

Industry:

Information Technology (1.00)
Leisure & Entertainment > Games > Computer Games (0.48)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.94)
(2 more...)

Add feedback

HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning

Jing, Zhi, Yang, Siyuan, Ao, Jicong, Xiao, Ting, Jiang, Yu-Gang, Bai, Chenjia

arXiv.org Artificial IntelligenceNov-18-2025

For robotic manipulation, existing robotics datasets and simulation benchmarks predominantly cater to robot-arm platforms. However, for humanoid robots equipped with dual arms and dexterous hands, simulation tasks and high-quality demonstrations are notably lacking. Bimanual dexterous manipulation is inherently more complex, as it requires coordinated arm movements and hand operations, making autonomous data collection challenging. This paper presents HumanoidGen, an automated task creation and demonstration collection framework that leverages atomic dexterous operations and LLM reasoning to generate relational constraints. Specifically, we provide spatial annotations for both assets and dexterous hands based on the atomic operations, and perform an LLM planner to generate a chain of actionable spatial constraints for arm movements based on object affordances and scenes. To further improve planning ability, we employ a variant of Monte Carlo tree search to enhance LLM reasoning for long-horizon tasks and insufficient annotation. In experiments, we create a novel benchmark with augmented scenarios to evaluate the quality of the collected data. The results show that the performance of the 2D and 3D diffusion policies can scale with the generated dataset. Project page is https://openhumanoidgen.github.io.

constraint, large language model, natural language, (18 more...)

arXiv.org Artificial Intelligence

2507.00833

Country:

Asia > China > Shanghai > Shanghai (0.04)
South America > Peru > Madre de Dios Department (0.04)

Genre:

Workflow (0.93)
Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Robots > Manipulation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

d630537fc4402cfa3ebbc7450a0cac91-Paper-Conference.pdf

Neural Information Processing SystemsNov-16-2025, 10:02:22 GMT

artificial intelligence, convergence, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.05)
Asia > China > Hong Kong (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.32)

Add feedback

A Other Methods

Neural Information Processing SystemsNov-14-2025, 18:48:39 GMT

We hypothesize that GDRO's incentive to perform well

ar focal reweight us reweight, artificial intelligence, machine learning, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback

9161ab7a1b61012c4c303f10b4c16b2c-AuthorFeedback.pdf

Neural Information Processing SystemsNov-14-2025, 12:38:11 GMT

artificial intelligence, asaf-1, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

Convergence of Actor-Critic Methods with Multi-Layer Neural Networks

Neural Information Processing SystemsNov-14-2025, 03:23:41 GMT

The early theory of actor-critic methods considered convergence using linear function approximators for the policy and value functions. Recent work has established convergence using neural network approximators with a single hidden layer. In this work we are taking the natural next step and establish convergence using deep neural networks with an arbitrary number of hidden layers, thus closing a gap between theory and practice. We show that actor-critic updates projected on a ball around the initial condition will converge to a neighborhood where the average of the squared gradients is O (1 / m) + O (ϵ), with m being the width of the neural network and ϵ the approximation quality of the best critic neural network over the projected set.

artificial intelligence, machine learning, min 2, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

report the final policy performance (mean std) over the seeds. Due to space constraints, we omit the learning curves

Neural Information Processing SystemsNov-13-2025, 07:53:32 GMT

We thank all the reviewers for their constructive feedback on improving the paper. Q. Are exploration and credit assignment (due to delayed rewards) the same? We agree that it's important to clarify this distinction and We'll include this in the revision. Q. Unintended output in provided Q. IRCR if there are indeed dense rewards? We have added a distributional variant of SAC (EXP .

artificial intelligence, guidance reward, machine learning, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.41)
Information Technology > Artificial Intelligence > Robots (0.31)

Add feedback

Distribution-Based Feature Attribution for Explaining the Predictions of Any Classifier

Li, Xinpeng, Ting, Kai Ming

arXiv.org Artificial IntelligenceNov-13-2025

The proliferation of complex, black-box AI models has intensified the need for techniques that can explain their decisions. Feature attribution methods have become a popular solution for providing post-hoc explanations, yet the field has historically lacked a formal problem definition. This paper addresses this gap by introducing a formal definition for the problem of feature attribution, which stipulates that explanations be supported by an underlying probability distribution represented by the given dataset. Our analysis reveals that many existing model-agnostic methods fail to meet this criterion, while even those that do often possess other limitations. To overcome these challenges, we propose Distributional Feature Attribution eXplanations (DFAX), a novel, model-agnostic method for feature attribution. DFAX is the first feature attribution method to explain classifier predictions directly based on the data distribution. We show through extensive experiments that DFAX is more effective and efficient than state-of-the-art baselines.

artificial intelligence, data mining, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2511.09332

Country:

Asia > China > Jiangsu Province > Nanjing (0.04)
South America > Paraguay > Asunción > Asunción (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report > Promising Solution (0.48)

Industry: Health & Medicine > Therapeutic Area (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science > Data Mining (0.93)

Add feedback

GenDexHand: Generative Simulation for Dexterous Hands

Chen, Feng, Xu, Zhuxiu, Chu, Tianzhe, Zhou, Xunzhe, Sun, Li, Wu, Zewen, Gao, Shenghua, Li, Zhongyu, Yang, Yanchao, Ma, Yi

arXiv.org Artificial IntelligenceNov-4-2025

Data scarcity remains a fundamental bottleneck for embodied intelligence. Existing approaches use large language models (LLMs) to automate gripper-based simulation generation, but they transfer poorly to dexterous manipulation, which demands more specialized environment design. Meanwhile, dexterous manipulation tasks are inherently more difficult due to their higher degrees of freedom. Massively generating feasible and trainable dexterous hand tasks remains an open challenge. To this end, we present GenDexHand, a generative simulation pipeline that autonomously produces diverse robotic tasks and environments for dexterous manipulation. GenDexHand introduces a closed-loop refinement process that adjusts object placements and scales based on vision-language model (VLM) feedback, substantially improving the average quality of generated environments. Each task is further decomposed into sub-tasks to enable sequential reinforcement learning, reducing training time and increasing success rates. Our work provides a viable path toward scalable training of diverse dexterous hand behaviors in embodied intelligence by offering a simulation-based solution to synthetic data generation. Our website: https://winniechen2002.github.io/GenDexHand/.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2511.01791

Country: