Post: Device Placement with Cross-Entropy Minimization and Proximal Policy Optimization
Yuanxiang Gao, Li Chen, Baochun Li
–Neural Information Processing Systems
Training deep neural networks requires an exorbitant amount of computation resources, including a heterogeneous mix of GPU and CPU devices. It is critical to place operations in a neural network on these devices in an optimal way, so that the training process can complete within the shortest amount of time. The state-of-the-art uses reinforcement learning to learn placement skills by repeatedly performing Monte-Carlo experiments. However, due to its equal treatment of placement samples, we argue that there remains ample room for significant improvements.
Neural Information Processing Systems
Nov-20-2025, 18:23:05 GMT
- Country:
- Asia
- China (0.04)
- Middle East > Jordan (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America
- Asia
- Technology: