PCNN: Pattern-based Fine-Grained Regular Pruning towards Optimizing CNN Accelerators
Tan, Zhanhong, Song, Jiebo, Ma, Xiaolong, Tan, Sia-Huat, Chen, Hongyang, Miao, Yuanqing, Wu, Yifu, Ye, Shaokai, Wang, Yanzhi, Li, Dehui, Ma, Kaisheng
Weight pruning is a powerful technique to realize model compression. We propose PCNN, a fine-grained regular 1D pruning method. A novel index format called Sparsity Pattern Mask (SPM) is presented to encode the sparsity in PCNN. Leveraging SPM with limited pruning patterns and non-zero sequences with equal length, PCNN can be efficiently employed in hardware. Evaluated on VGG-16 and ResNet-18, our PCNN achieves the compression rate up to 8.4X with only 0.2% accuracy loss. We also implement a pattern-aware architecture in 55nm process, achieving up to 9.0X speedup and 28.39 TOPS/W efficiency with only 3.1% on-chip memory overhead of indices.
Feb-11-2020
- Country:
- North America
- United States
- Hawaii > Honolulu County
- Honolulu (0.04)
- California
- San Diego County > San Diego (0.04)
- Monterey County > Seaside (0.04)
- Hawaii > Honolulu County
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada > Quebec
- Montreal (0.04)
- United States
- Europe
- France (0.04)
- United Kingdom > Wales
- Swansea (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Italy > Veneto
- Venice (0.04)
- Austria > Upper Austria
- Linz (0.04)
- Asia
- South Korea > Seoul
- Seoul (0.04)
- China > Shaanxi Province
- Xi'an (0.04)
- South Korea > Seoul
- North America
- Genre:
- Research Report (0.50)
- Technology: