GhostNetV2: Enhance Cheap Operation with Long-Range Attention

Oct-10-2024, 19:28:22 GMT–Neural Information Processing Systems

Light-weight convolutional neural networks (CNNs) are specially designed for applications on mobile devices with faster inference speed. The convolutional operation can only capture local information in a window region, which prevents performance from being further improved. Introducing self-attention into convolution can capture global information well, but it will largely encumber the actual speed. In this paper, we propose a hardware-friendly attention mechanism (dubbed DFC attention) and then present a new GhostNetV2 architecture for mobile applications. The proposed DFC attention is constructed based on fully-connected layers, which can not only execute fast on common hardware but also capture the dependence between long-range pixels.

enhance cheap operation, ghostnetv2, long-range attention, (3 more...)

Neural Information Processing Systems

Oct-10-2024, 19:28:22 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology
  - Communications > Mobile (0.62)
  - Artificial Intelligence > Machine Learning
    - Neural Networks > Deep Learning (0.42)