BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network Inference
–Neural Information Processing Systems
To address these challenges, we introduce the Block-Level Adaptive STructured (BLAST) matrix, designed to learn and leverage efficient structures prevalent in the weight matrices of linear layers within deep learning models. Compared to existing structured matrices, the BLAST matrix offers substantial flexibility, as it can represent various types of structures that are either learned from data or computed from pre-existing weight matrices.
Neural Information Processing Systems
May-28-2025, 15:49:05 GMT
- Country:
- North America > Canada > Ontario > Toronto (0.14)
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Education (0.67)
- Technology: