CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor Optimization

May-27-2025, 08:22:37 GMT–Neural Information Processing Systems

The high training cost has become only affordable to big tech companies, meanwhile also causing increasing concerns about the environmental impact. This paper presents CoMERA, a **Co**mputing- and **M**emory-**E**fficient training method via **R**ank-**A**daptive tensor optimization. CoMERA achieves end-to-end rank-adaptive tensor-compressed training via a multi-objective optimization formulation, and improves the training to provide both a high compression ratio and excellent accuracy in the training process. Our optimized numerical computation (e.g., optimized tensorized embedding and tensor-vector contractions) and GPU implementation eliminate part of the run-time overhead in the tensorized training on GPU. This leads to, for the first time, 2-3\times speedup per training epoch compared with standard training.

comera, computing-and memory-efficient training, rank-adaptive tensor optimization, (4 more...)

Neural Information Processing Systems

May-27-2025, 08:22:37 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (0.67)
  - Representation & Reasoning (0.41)