Efficient Rematerialization for Deep Networks

Kumar, Ravi, Purohit, Manish, Svitkina, Zoya, Vee, Erik, Wang, Joshua

Mar-19-2020, 03:01:40 GMT–Neural Information Processing Systems

When training complex neural networks, memory usage can be an important bottleneck. The question of when to rematerialize, i.e., to recompute intermediate values rather than retaining them in memory, becomes critical to achieving the best time and space efficiency. In this work we consider the rematerialization problem and devise efficient algorithms that use structural characterizations of computation graphs---treewidth and pathwidth---to obtain provably efficient rematerialization schedules. Our experiments demonstrate the performance of these algorithms on many common deep learning models. Papers published at the Neural Information Processing Systems Conference.

deep learning, efficient rematerialization, neural network, (4 more...)

Neural Information Processing Systems

Mar-19-2020, 03:01:40 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.73)