High Frequency Residual Learning for Multi-Scale Image Classification

Cheng, Bowen, Xiao, Rong, Wang, Jianfeng, Huang, Thomas, Zhang, Lei

May-7-2019–arXiv.org Machine Learning

We present a novel high frequency residual learning framework, which leads to a highly efficient multi-scale network (MSNet) architecture for mobile and embedded vision problems. The architecture utilizes two networks: a low resolution network to efficiently approximate low frequency components and a high resolution network to learn high frequency residuals by reusing the upsampled low resolution features. With a classifier calibration module, MSNet can dynamically allocate computation resources during inference to achieve a better speed and accuracy trade-off. We evaluate our methods on the challenging ImageNet-1k dataset and observe consistent improvements over different base networks. On ResNet-18 and MobileNet with alpha=1.0, MSNet gains 1.5% accuracy over both architectures without increasing computations. On the more efficient MobileNet with alpha=0.25, our method gains 3.8% accuracy with the same amount of computations.

accuracy, artificial intelligence, machine learning, (16 more...)

arXiv.org Machine Learning

May-7-2019

arXiv.org PDF

Add feedback

Country:
- Asia > China (0.04)
- North America > United States
  - Washington > King County
    - Redmond (0.04)
  - Illinois > Champaign County
    - Urbana (0.04)

Genre:
- Research Report (0.82)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (1.00)
  - Artificial Intelligence
    - Vision (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found