Efficient Identification of Butterfly Sparse Matrix Factorizations

Zheng, Léon, Riccietti, Elisa, Gribonval, Rémi

Nov-7-2022–arXiv.org Artificial Intelligence

Fast transforms correspond to factorizations of the form $\mathbf{Z} = \mathbf{X}^{(1)} \ldots \mathbf{X}^{(J)}$, where each factor $ \mathbf{X}^{(\ell)}$ is sparse and possibly structured. This paper investigates essential uniqueness of such factorizations, i.e., uniqueness up to unavoidable scaling ambiguities. Our main contribution is to prove that any $N \times N$ matrix having the so-called butterfly structure admits an essentially unique factorization into $J$ butterfly factors (where $N = 2^{J}$), and that the factors can be recovered by a hierarchical factorization method, which consists in recursively factorizing the considered matrix into two factors. This hierarchical identifiability property relies on a simple identifiability condition in the two-layer and fixed-support setting. This approach contrasts with existing ones that fit the product of butterfly factors to a given matrix via gradient descent. The proposed method can be applied in particular to retrieve the factorization of the Hadamard or the discrete Fourier transform matrices of size $N=2^J$. Computing such factorizations costs $\mathcal{O}(N^{2})$, which is of the order of dense matrix-vector multiplication, while the obtained factorizations enable fast $\mathcal{O}(N \log N)$ matrix-vector multiplications and have the potential to be applied to compress deep neural networks.

artificial intelligence, data quality, machine learning, (19 more...)

arXiv.org Artificial Intelligence

Nov-7-2022

arXiv.org PDF

Add feedback

Country:
- Europe
  - Switzerland > Basel-City
    - Basel (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)
- Asia > Singapore
  - Central Region > Singapore (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology
  - Data Science > Data Quality
    - Data Transformation (0.86)
  - Artificial Intelligence > Machine Learning
    - Neural Networks > Deep Learning (0.48)
    - Statistical Learning > Gradient Descent (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found