Zero Time Waste: Recycling Predictions in Early Exit Neural Networks

Apr-24-2026, 19:54:57 GMT–Neural Information Processing Systems

The problem of reducing processing time of large deep learning models is a fundamental challenge in many real-world applications. Early exit methods strive towards this goal by attaching additional Internal Classifiers (ICs) to intermediate layers of a neural network. ICs can quickly return predictions for easy examples and, as a result, reduce the average inference time of the whole model. However, if a particular IC does not decide to return an answer early, its predictions are discarded, with its computations effectively being wasted. To solve this issue, we introduce Zero Time Waste (ZTW), a novel approach in which each IC reuses predictions returned by its predecessors by (1) adding direct connections between ICs and (2) combining previous outputs in an ensemble-like manner. We conduct extensive experiments across various datasets and architectures to demonstrate that ZTW achieves a significantly better accuracy vs. inference time trade-off than other recently proposed early exit methods.

artificial intelligence, information, machine learning, (16 more...)

Neural Information Processing Systems

Apr-24-2026, 19:54:57 GMT

Conferences PDF

Add feedback

Country:
- Europe (0.28)

Genre:
- Research Report > Promising Solution (0.34)

Industry:
- Transportation (0.68)
- Information Technology > Robotics & Automation (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
ZeroTimeWaste: RecyclingPredictions inEarlyExitNeuralNetworks

Similar Docs Excel Report more

Title	Similarity	Source
None found