Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation

Denton, Remi, Zaremba, Wojciech, Bruna, Joan, LeCun, Yann, Fergus, Rob

Jun-9-2014–arXiv.org Artificial Intelligence

We present techniques for speeding up the test-time evaluation of large convolutional networks, designed for object recognition tasks. These models deliver impressive accuracy but each image evaluation requires millions of floating point operations, making their deployment on smartphones and Internet-scale clusters problematic. The computation is dominated by the convolution operations in the lower layers of the model. We exploit the linear structure present within the convolutional filters to derive approximations that significantly reduce the required computation. Using large state-of-the-art models, we demonstrate we demonstrate speedups of convolutional layers on both CPU and GPU by a factor of 2x, while keeping the accuracy within 1% of the original model.

approximation, convolutional layer, opération, (16 more...)

arXiv.org Artificial Intelligence

Jun-9-2014

arXiv.org PDF

Add feedback

Country:
- North America > United States > New York (0.04)

Genre:
- Research Report > Promising Solution (0.54)

Technology:
- Information Technology
  - Communications (1.00)
  - Artificial Intelligence
    - Machine Learning > Neural Networks (1.00)
    - Vision (0.88)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found