CNN Mixture-of-Depths
Cakaj, Rinor, Mehnert, Jens, Yang, Bin
–arXiv.org Artificial Intelligence
We introduce Mixture-of-Depths (MoD) for Convolutional Neural Networks (CNNs), a novel approach that enhances the computational efficiency of CNNs by selectively processing channels based on their relevance to the current prediction. This method optimizes computational resources by dynamically selecting key channels in feature maps for focused processing within the convolutional blocks (Conv-Blocks), while skipping less relevant channels. Unlike conditional computation methods that require dynamic computation graphs, CNN MoD uses a static computation graph with fixed tensor sizes which improve hardware efficiency. It speeds up the training and inference processes without the need for customized CUDA kernels, unique loss functions, or finetuning. CNN MoD either matches the performance of traditional CNNs with reduced inference times, GMACs, and parameters, or exceeds their performance while maintaining similar inference times, GMACs, and parameters. For example, on ImageNet, ResNet86-MoD exceeds the performance of the standard ResNet50 by 0.45% with a 6% speedup on CPU and 5% on GPU. Moreover, ResNet75-MoD achieves the same performance as ResNet50 with a 25% speedup on CPU and 15% on GPU.
arXiv.org Artificial Intelligence
Sep-25-2024
- Country:
- North America
- United States
- Washington > King County
- Seattle (0.04)
- Utah > Salt Lake County
- Salt Lake City (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- California
- Los Angeles County > Long Beach (0.04)
- San Diego County > San Diego (0.04)
- Washington > King County
- Puerto Rico > San Juan
- San Juan (0.04)
- Mexico > Quintana Roo
- Cancún (0.04)
- Canada
- Ontario > Toronto (0.14)
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States
- Europe
- France (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Italy > Veneto
- Venice (0.04)
- Germany
- Bavaria > Upper Bavaria
- Munich (0.04)
- Baden-Württemberg > Stuttgart Region
- Stuttgart (0.04)
- Bavaria > Upper Bavaria
- Asia > China
- Guangdong Province > Shenzhen (0.04)
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- North America
- Genre:
- Research Report > Promising Solution (0.34)
- Overview > Innovation (0.34)
- Industry:
- Automobiles & Trucks (0.55)
- Technology: