Exponentially Increasing the Capacity-to-Computation Ratio for Conditional Computation in Deep Learning

Open in new window