PCNN: Pattern-based Fine-Grained Regular Pruning towards Optimizing CNN Accelerators

Tan, Zhanhong, Song, Jiebo, Ma, Xiaolong, Tan, Sia-Huat, Chen, Hongyang, Miao, Yuanqing, Wu, Yifu, Ye, Shaokai, Wang, Yanzhi, Li, Dehui, Ma, Kaisheng

Feb-11-2020–arXiv.org Machine Learning

Weight pruning is a powerful technique to realize model compression. We propose PCNN, a fine-grained regular 1D pruning method. A novel index format called Sparsity Pattern Mask (SPM) is presented to encode the sparsity in PCNN. Leveraging SPM with limited pruning patterns and non-zero sequences with equal length, PCNN can be efficiently employed in hardware. Evaluated on VGG-16 and ResNet-18, our PCNN achieves the compression rate up to 8.4X with only 0.2% accuracy loss. We also implement a pattern-aware architecture in 55nm process, achieving up to 9.0X speedup and 28.39 TOPS/W efficiency with only 3.1% on-chip memory overhead of indices.

artificial intelligence, machine learning, pruning, (19 more...)

arXiv.org Machine Learning

Feb-11-2020

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - Hawaii > Honolulu County
      - Honolulu (0.04)
    - California
      - San Diego County > San Diego (0.04)
      - Monterey County > Seaside (0.04)
  - Puerto Rico > San Juan
    - San Juan (0.04)
  - Canada > Quebec
    - Montreal (0.04)
- Europe
  - France (0.04)
  - United Kingdom > Wales
    - Swansea (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Italy > Veneto
    - Venice (0.04)
  - Austria > Upper Austria
    - Linz (0.04)
- Asia
  - South Korea > Seoul
    - Seoul (0.04)
  - China > Shaanxi Province
    - Xi'an (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.51)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found