AITopics | gigastep

Appendix Gigastep - One Billion Steps per Second Multi-agent Reinforcement Learning

Neural Information Processing SystemsApr-24-2026, 04:43:02 GMT

In this section, we train policies for different scenarios to validate that the tasks defined in Gigastep can be solved with multi-agent RL algorithms. In particular, we use multi-agent PPO implemented in JAX. In competitive or adversarial MARL, an objective reward measure is not defined, as the collected reward inherently depends on the relative strength of the opposing agent's policy. Therefore, to measure the training progress, we compare the current policy with previous checkpoints of the same policy at earlier training iterations. Specifically, an improving policy should be able to outperform its previous counterparts.

checkpoint, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.32)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

00ba06ba5c324efdfb068865ca44cf0b-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsApr-24-2026, 04:42:59 GMT

evolutionary algorithm, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts (0.15)

Industry:

Leisure & Entertainment > Games > Computer Games (0.68)
Transportation (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
(3 more...)

Add feedback

Gigastep - One Billion Steps per Second Multi-agent Reinforcement Learning

Neural Information Processing SystemsDec-23-2025, 16:37:43 GMT

Multi-agent reinforcement learning (MARL) research is faced with a trade-off: it either uses complex environments requiring large compute resources, which makes it inaccessible to researchers with limited resources, or relies on simpler dynamics for faster execution, which makes the transferability of the results to more realistic tasks challenging. Motivated by these challenges, we present Gigastep, a fully vectorizable, MARL environment implemented in JAX, capable of executing up to one billion environment steps per second on consumer-grade hardware. Its design allows for comprehensive MARL experimentation, including a complex, high-dimensional space defined by 3D dynamics, stochasticity, and partial observations. Gigastep supports both collaborative and adversarial tasks, continuous and discrete action spaces, and provides RGB image and feature vector observations, allowing the evaluation of a wide range of MARL algorithms.

gigastep, name change, second multi-agent reinforcement learning, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.67)

Add feedback

Gigastep - One Billion Steps per Second Multi-agent Reinforcement Learning

Neural Information Processing SystemsOct-9-2024, 08:14:16 GMT

Multi-agent reinforcement learning (MARL) research is faced with a trade-off: it either uses complex environments requiring large compute resources, which makes it inaccessible to researchers with limited resources, or relies on simpler dynamics for faster execution, which makes the transferability of the results to more realistic tasks challenging. Motivated by these challenges, we present Gigastep, a fully vectorizable, MARL environment implemented in JAX, capable of executing up to one billion environment steps per second on consumer-grade hardware. Its design allows for comprehensive MARL experimentation, including a complex, high-dimensional space defined by 3D dynamics, stochasticity, and partial observations. Gigastep supports both collaborative and adversarial tasks, continuous and discrete action spaces, and provides RGB image and feature vector observations, allowing the evaluation of a wide range of MARL algorithms.

gigastep, second multi-agent reinforcement learning

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Filters

Collaborating Authors

gigastep

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Appendix Gigastep - One Billion Steps per Second Multi-agent Reinforcement Learning

00ba06ba5c324efdfb068865ca44cf0b-Paper-Datasets_and_Benchmarks.pdf

Gigastep - One Billion Steps per Second Multi-agent Reinforcement Learning

Gigastep - One Billion Steps per Second Multi-agent Reinforcement Learning