AITopics | supernet

MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory

Neural Information Processing SystemsMay-1-2026, 02:15:57 GMT

Due to the high price and heavy energy consumption of GPUs, deploying deep models on IoT devices such as microcontrollers makes significant contributions for ecological AI. Conventional methods successfully enable convolutional neural network inference of high resolution images on microcontrollers, while the framework for vision transformers that achieve the state-of-the-art performance in many vision applications still remains unexplored. In this paper, we propose a hardware-algorithm co-optimizations method called MCUFormer to deploy vision transformers on microcontrollers with extremely limited memory, where we jointly design transformer architecture and construct the inference operator library to fit the memory resource constraint. More specifically, we generalize the one-shot network architecture search (NAS) to discover the optimal architecture with highest task performance given the memory budget from the microcontrollers, where we enlarge the existing search space of vision transformers by considering the low-rank decomposition dimensions and patch resolution for memory reduction. For the construction of the inference operator library of vision transformers, we schedule the memory buffer during inference through operator integration, patch embedding decomposition, and token overwriting, allowing the memory buffer to be fully utilized to adapt to the forward pass of the vision transformer. Experimental results demonstrate that our MCUFormer achieves 73.62% top-1 accuracy on ImageNet for image classification with 320KB memory on STM32F746 microcontroller.

artificial intelligence, deep learning, machine learning, (15 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

48e95c45c8217961bf6cd7696d80d238-Supplemental.pdf

Neural Information Processing SystemsApr-25-2026, 17:46:53 GMT

architecture, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Country: Oceania > Australia (0.15)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.99)

Add feedback

Searching the Search Space of Vision Transformer

Neural Information Processing SystemsApr-25-2026, 17:46:49 GMT

Vision Transformer has shown great visual representation power in substantial vision tasks such as recognition and detection, and thus been attracting fast-growing efforts on manually designing more effective architectures. In this paper, we propose to use neural architecture search to automate this process, by searching not only the architecture but also the search space. The central idea is to gradually evolve different search dimensions guided by their E-TError computed using a weight-sharing supernet. Moreover, we provide design guidelines of general vision transformers with extensive analysis according to the space searching process, which could promote the understanding of vision transformer. Remarkably, the searched models, named S3 (short for Searching the Search Space), from the searched space achieve superior performance to recently proposed models, such as Swin, DeiT and ViT, when evaluated on ImageNet. The effectiveness of S3 is also illustrated on object detection, semantic segmentation and visual question answering, demonstrating its generality to downstream vision and vision-language tasks. Code and models will be available at here.

artificial intelligence, machine learning, search space, (15 more...)

Neural Information Processing Systems

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.88)

Add feedback

08aee6276db142f4b8ac98fb8ee0ed1b-Paper.pdf

Neural Information Processing SystemsApr-24-2026, 13:53:48 GMT

artificial intelligence, machine learning, supernet, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

e0bc6dbcbcc957b2aeadb20c39ba7f05-Paper-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 13:58:48 GMT

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.74)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)

Add feedback

DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization

Neural Information Processing SystemsFeb-17-2026, 06:54:36 GMT

Diffusion models have achieved remarkable progress in the field of image generation due to their outstanding capabilities.

diffusion model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Information Technology (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

cf1129594f603fde9e1913d10b7dbf77-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 00:11:04 GMT

architecture, architecture search, international conference, (15 more...)

Neural Information Processing Systems

Country:

Europe > Denmark > North Jutland > Aalborg (0.04)
Asia > China > Heilongjiang Province > Harbin (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.90)

Add feedback

4e839c9c398c58c878a394633b806ccd-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 15:38:40 GMT

architecture, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country: Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.46)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.72)

Add feedback

SearchingforBetterSpatio-temporalAlignmentin Few-ShotActionRecognition

Neural Information Processing SystemsFeb-10-2026, 11:46:05 GMT

One ofthe most important tasks invideo understanding istounderstand human actions, which is one of the representativetasks for video understanding [54].

artificial intelligence, machine learning, wang, (18 more...)

Neural Information Processing Systems

Country: Asia > China (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Video Understanding (0.34)

Add feedback

65d90fc6d307590b14e9e1800d4e8eab-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 02:33:01 GMT

We visualize the Pareto frontiers discovered by OSEs in the right subplots of the two figures. The blue lines with square markers show the one-shot scores of the GTPareto frontier, while the orange/green/red lines show the GT scores of the OSPareto frontier.

architecture, artificial intelligence, machine learning, (19 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.35)

Add feedback

Filters

Collaborating Authors

supernet

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory

48e95c45c8217961bf6cd7696d80d238-Supplemental.pdf

Searching the Search Space of Vision Transformer

08aee6276db142f4b8ac98fb8ee0ed1b-Paper.pdf

e0bc6dbcbcc957b2aeadb20c39ba7f05-Paper-Conference.pdf

DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization

cf1129594f603fde9e1913d10b7dbf77-Paper-Conference.pdf

4e839c9c398c58c878a394633b806ccd-Paper-Conference.pdf

SearchingforBetterSpatio-temporalAlignmentin Few-ShotActionRecognition

65d90fc6d307590b14e9e1800d4e8eab-Supplemental.pdf