AITopics | rlpyt

Collaborating Authors

rlpyt

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

c6447300d99fdbf4f3f7966295b8b5be-AuthorFeedback.pdf

Neural Information Processing SystemsAug-16-2025, 08:47:29 GMT

assumption, step time, synchronization, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Review for NeurIPS paper: High-Throughput Synchronous Deep RL

Neural Information Processing SystemsMay-31-2025, 19:27:11 GMT

The baselines are somehow weak. Though TorchBeast is a strong baseline, the PPO and A2C from Kostrikov seem weak. As far as I know, faster training is not the goal of Kostrikov's implementation. For PPO, the implementation from OpenAI baselines are stronger, which features parallelization with MPI and all-reduce gradients. For A2C, one could consider rlpyt (rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch), where various sampling schemes (including batch synchronization) and optimization schemes can be used.

batch synchronization, high-throughput synchronous deep rl, implementation, (9 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.57)

Add feedback

Best of arXiv.org for AI, Machine Learning, and Deep Learning – September 2019 - insideBIGDATA

#artificialintelligenceOct-20-2019, 03:13:51 GMT

Researchers from all over the world contribute to this repository as a prelude to the peer review process for publication in traditional journals. We hope to save you some time by picking out articles that represent the most promise for the typical data scientist. The articles listed below represent a fraction of all articles appearing on the preprint server. They are listed in no particular order with a link to each paper along with a brief overview. Especially relevant articles are marked with a "thumbs up" icon.

algorithm, machine learning, triplet constraint, (14 more...)

#artificialintelligence

Genre:

Overview (0.70)
Research Report (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)

Add feedback

Boston Dynamics Lets the Dogs Out; Google Releases Deepfake Detection Dataset

#artificialintelligenceOct-1-2019, 06:38:39 GMT

Boston Dynamics' Robot Dog Is Now Available for Select Customers Boston Dynamics has begun commercialization of its robodog Spot. The company released a video on Tuesday that shows Spot navigating challenging terrain, picking up construction objects, moving through bad weather, and picking itself up after a fall. Boston Dynamics' Atlas Can Now Do An Impressive Gymnastics Routine Alongside the news that Boston Dynamics is letting robot dog Spot out of its laboratory for the first time, the company has released a new video of Atlas, a spectacular bipedal robot that's previously been seen doing everything from parkour to backflips. Contributing Data to Deepfake Detection Research In collaboration with Jigsaw, Google has announced the release of a large dataset of visual deepfakes they have produced. The data has been incorporated into the Technical University of Munich and the University Federico II of Naples' new FaceForensics benchmark, an effort that Google co-sponsors.

boston dynamic, google release deepfake detection dataset, learning, (12 more...)

#artificialintelligence

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.27)
North America > Canada > Ontario > Toronto (0.17)
North America > United States > New York (0.06)
(3 more...)

Industry: Information Technology > Security & Privacy (0.83)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.80)

Add feedback

rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch

Stooke, Adam, Abbeel, Pieter

arXiv.org Artificial IntelligenceSep-24-2019

Since the recent advent of deep reinforcement learning for game play and simulated robotic control, a multitude of new algorithms have flourished. Most are model-free algorithms which can be categorized into three families: deep Q-learning, policy gradients, and Q-value policy gradients. These have developed along separate lines of research, such that few, if any, code bases incorporate all three kinds. Yet these algorithms share a great depth of common deep reinforcement learning machinery. We are pleased to share rlpyt, which implements all three algorithm families on top of a shared, optimized infrastructure, in a single repository. It contains modular implementations of many common deep RL algorithms in Python using PyTorch, a leading deep learning library. rlpyt is designed as a high-throughput code base for small- to medium-scale research in deep RL. This white paper summarizes its features, algorithms implemented, and relation to prior work, and concludes with detailed implementation and usage notes. rlpyt is available at https://github.com/astooke/rlpyt.

algorithm, arxiv preprint arxiv, rlpyt, (12 more...)

arXiv.org Artificial Intelligence

1909.015

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.40)

Industry: Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback