Characterizing the Efficiency of Distributed Training: A Power, Performance, and Thermal Perspective

Open in new window