A.1 in Spark Streaming
–Neural Information Processing Systems
In Figure A1, we show the high-level pseudocode of our port of the PPO algorithm to Spark Streaming. Similar to our port of RLlib to RLlib Flow, we only changed the parts of the PPO algorithm in RLlib that affect distributed execution, keeping the core algorithm implementation (e.g., numerical definition of policy loss and neural networks in TensorFlow) as Figure A1: Example of Spark Streaming for Distributed RL. We conduct comparisons between the performance of both implementations. Experiments here are conducted on A WS m4.10xlarge instances. Looping operations are not well supported.
Neural Information Processing Systems
Oct-3-2025, 03:13:35 GMT
- Technology: