AITopics | arg

Collaborating Authors

arg

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ed73c36e771881b232ef35fa3a1dec14-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsApr-30-2026, 05:23:45 GMT

artificial intelligence, climatelearn, machine learning, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

a815fe7cad6af20a6c118f2072a881d2-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 05:23:15 GMT

Admittedly, NP variants can be applied aseries of downstream tasks. Our selection of benchmark missions is based on existing literature for NP models.

artificial intelligence, machine learning, module, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.96)

Add feedback

SupplementaryMaterialfor HandMeThat: Human-RobotCommunication inPhysicalandSocialEnvironments

Neural Information Processing SystemsFeb-8-2026, 20:52:56 GMT

In Section B, we summarize the statistics of the dataset. A.1 ObjectSpace Recall that HandMeThat uses an object-centric representation for states. Object hierarchy.HandMeThat classifies all categories into 5classes: location, receptacle, food, tool,andthing. Each class (except for"location") iscomposed ofmultiple subclasses, and each subclass contains several object categories. Intotal, there are155 object categories.

artificial intelligence, loc-location action, refrigerator, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)

Add feedback

18ddfb199d71a8a24f83abc1ced077b7-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 17:04:40 GMT

artificial intelligence, encoder, subgoal, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.95)

Add feedback

Hybrid Differential Reward: Combining Temporal Difference and Action Gradients for Efficient Multi-Agent Reinforcement Learning in Cooperative Driving

Han, Ye, Zhang, Lijun, Meng, Dejian, Zhang, Zhuang

arXiv.org Artificial IntelligenceNov-24-2025

In multi-vehicle cooperative driving tasks involving high-frequency continuous control, traditional state-based reward functions suffer from the issue of vanishing reward differences. This phenomenon results in a low signal-to-noise ratio (SNR) for policy gradients, significantly hindering algorithm convergence and performance improvement. To address this challenge, this paper proposes a novel Hybrid Differential Reward (HDR) mechanism. We first theoretically elucidate how the temporal quasi-steady nature of traffic states and the physical proximity of actions lead to the failure of traditional reward signals. Building on this analysis, the HDR framework innovatively integrates two complementary components: (1) a Temporal Difference Reward (TRD) based on a global potential function, which utilizes the evolutionary trend of potential energy to ensure optimal policy invariance and consistency with long-term objectives; and (2) an Action Gradient Reward (ARG), which directly measures the marginal utility of actions to provide a local guidance signal with a high SNR. Furthermore, we formulate the cooperative driving problem as a Multi-Agent Partially Observable Markov Game (POMDPG) with a time-varying agent set and provide a complete instantiation scheme for HDR within this framework. Extensive experiments conducted using both online planning (MCTS) and Multi-Agent Reinforcement Learning (QMIX, MAPPO, MADDPG) algorithms demonstrate that the HDR mechanism significantly improves convergence speed and policy stability. The results confirm that HDR guides agents to learn high-quality cooperative policies that effectively balance traffic efficiency and safety.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2511.16916

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.48)

Industry: Transportation > Ground > Road (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

Add feedback

Supplementary Material for HandMeThat: Human-Robot Communication in Physical and Social Environments Y anming Wan

Neural Information Processing SystemsAug-14-2025, 18:21:31 GMT

In Section A, we provide the detailed information for HandMeThat data generation and its textual interface. In Section B, we summarize the statistics of the dataset. Recall that HandMeThat uses an object-centric representation for states. "Location" consists of all non-movable entities. Each class (except for "location") is composed of multiple subclasses, and each subclass contains In total, there are 155 object categories. Each object category is also associated with several attributes.

agent, category, dataset, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Workflow (0.67)

Industry: Consumer Products & Services (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.46)
Information Technology > Artificial Intelligence > Robots > Humanoid Robots (0.41)

Add feedback

ADL: A Declarative Language for Agent-Based Chatbots

Zeng, Sirui, Yan, Xifeng

arXiv.org Artificial IntelligenceJul-29-2025

There are numerous frameworks capable of creating and orchestrating agents to address complex tasks. However, most of them highly coupled Python programming with agent declaration, making it hard for maintenance and runtime optimization. In this work, we introduce ADL, an agent declarative language for customer service chatbots. ADL abstracts away implementation details, offering a declarative way to define agents and their interactions, which could ease maintenance and debugging. It also incorporates natural language programming at its core to simplify the specification and communication of chatbot designs. ADL includes four basic types of agents and supports integration with custom functions, tool use, and third-party agents. MICA, a multi-agent system designed to interpret and execute ADL programs, has been developed and is now available as an open-source project at https://github.com/Mica-labs/MICA. Its documentation can be found at https://mica-labs.github.io/.

agent, artificial intelligence, natural language, (17 more...)

arXiv.org Artificial Intelligence

2504.14787

Country: North America > United States (0.46)

Genre:

Research Report (0.50)
Workflow (0.47)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.47)
Banking & Finance (0.46)
Retail (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

OkadaTorch: A Differentiable Programming of Okada Model to Calculate Displacements and Strains from Fault Parameters

Someya, Masayoshi, Yamada, Taisuke, Okazaki, Tomohisa

arXiv.org Artificial IntelligenceJul-24-2025

The Okada model is a widely used analytical solution for displacements and strains caused by a point or rectangular dislocation source in a 3D elastic half-space. We present OkadaTorch, a PyTorch implementation of the Okada model, where the entire code is differentiable; gradients with respect to input can be easily computed using automatic differentiation (AD). Our work consists of two components: a direct translation of the original Okada model into PyTorch, and a convenient wrapper interface for efficiently computing gradients and Hessians with respect to either observation station coordinates or fault parameters. This differentiable framework is well suited for fault parameter inversion, including gradient-based optimization, Bayesian inference, and integration with scientific machine learning (SciML) models. Our code is available here: https://github.com/msomeya1/OkadaTorch

artificial intelligence, fault parameter, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2507.17126

Country: Asia > Japan > Honshū > Tōhoku (0.14)

Genre: Research Report (0.64)

Industry: Energy (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Filters

Collaborating Authors

arg

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

ed73c36e771881b232ef35fa3a1dec14-Supplemental-Datasets_and_Benchmarks.pdf

ed73c36e771881b232ef35fa3a1dec14-Supplemental-Datasets_and_Benchmarks.pdf

a815fe7cad6af20a6c118f2072a881d2-Supplemental-Conference.pdf

SupplementaryMaterialfor HandMeThat: Human-RobotCommunication inPhysicalandSocialEnvironments

18ddfb199d71a8a24f83abc1ced077b7-Supplemental-Conference.pdf

Hybrid Differential Reward: Combining Temporal Difference and Action Gradients for Efficient Multi-Agent Reinforcement Learning in Cooperative Driving

b9432d0f94275f0571c6cc99cf8b1664-Supplemental-Conference.pdf

Supplementary Material for HandMeThat: Human-Robot Communication in Physical and Social Environments Y anming Wan

ADL: A Declarative Language for Agent-Based Chatbots

OkadaTorch: A Differentiable Programming of Okada Model to Calculate Displacements and Strains from Fault Parameters