How Low Can We Go: Trading Memory for Error in Low-Precision Training

Yang, Chengrun, Wu, Ziyang, Chee, Jerry, De Sa, Christopher, Udell, Madeleine

Jun-17-2021–arXiv.org Artificial Intelligence

However, we pay a price for the savings: lower precision may yield larger round-off error and hence larger prediction error. As applications proliferate, users must choose which precision to use to train a new model, and chip manufacturers must decide which precisions to manufacture. We view these precision choices as a hyperparameter tuning problem, and borrow ideas from meta-learning to learn the tradeoff between memory and error. In this paper, we introduce Pareto Estimation to Pick the Perfect Precision (PEPPP). We use matrix factorization to find non-dominated configurations (the Pareto frontier) with a limited number of network evaluations. For any given memory budget, the precision that minimizes error is a point on this frontier. Practitioners can use the frontier to trade memory for error and choose the best precision for their goals.

cifar100, configuration, dataset, (13 more...)

arXiv.org Artificial Intelligence

Jun-17-2021

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York > Tompkins County > Ithaca (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found