PredictingTrainingTimeWithoutTraining SupplementaryMaterial

Feb-8-2026, 06:05:19 GMT–Neural Information Processing Systems

In both cases we observe that the predicted curve is reasonably close to the actual curve, more so at the beginning of the training (which is expected, sincethelinearapproximation ismorelikelytohold). Point-wise similarity of predicted and observed loss curve. Up to now we focused on prediction error rates (see e.g. We started defining training time as the first time the (smoothed) loss is belowagiventhreshold(whichwethennormalizedw.r.t. In Section 4we suggest that, in the case of MSE loss, itispossible to predict the training time on alargedataset using asubset ofthesamples. However,sinceourtraining time definition measures the time to reach the asymptotic value (which is what is most useful in practice) rather than the time reach an absolute threshold, this does not affect the accuracy of the prediction(seeAppendixC).

artificial intelligence, machine learning, training time, (19 more...)

Neural Information Processing Systems

Feb-8-2026, 06:05:19 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)

Duplicate Docs Excel Report

Title
Predicting Training Time Without Training Supplementary Material

Similar Docs Excel Report more

Title	Similarity	Source
None found