Appendix A Visual Reinforcement Learning Baselines DrQ: This model-free, off-policy reinforcement learning algorithm, is based on Soft Actor-Critic (SAC) [
–Neural Information Processing Systems
Meanwhile, we utilize the 3D scenes from the Gibson dataset as our map for all experiments. Autonomous driving: We choose the stable version of CARLA 0.9.10 for simulation.
Neural Information Processing Systems
Nov-13-2025, 22:11:02 GMT
- Country:
- Europe > Sweden > Skåne County > Malmö (0.04)
- Industry:
- Transportation > Ground > Road (0.34)
- Technology: