Efficient Algorithms for Device Placement of DNN Graph Operators
–Neural Information Processing Systems
Modern machine learning workloads use large models, with complex structures, that are very expensive to execute. The devices that execute complex models are becoming increasingly heterogeneous as we see a flourishing of Domain Specific Architectures (DSAs) being offered as hardware accelerators in addition to CPUs.
Neural Information Processing Systems
Dec-24-2025, 11:17:13 GMT
- Technology: