Real-time Core-Periphery Guided ViT with Smart Data Layout Selection on Mobile Devices
–Neural Information Processing Systems
Mobile devices have become essential enablers for AI applications, particularly in scenarios that require real-time performance. Vision Transformer (ViT) has become a fundamental cornerstone in this regard due to its high accuracy. Recent efforts have been dedicated to developing various transformer architectures that offer im- proved accuracy while reducing the computational requirements. However, existing research primarily focuses on reducing the theoretical computational complexity through methods such as local attention and model pruning, rather than considering realistic performance on mobile hardware. Although these optimizations reduce computational demands, they either introduce additional overheads related to data transformation (e.g., Reshape and Transpose) or irregular computation/data-access patterns.
Neural Information Processing Systems
Mar-17-2025, 14:19:27 GMT
- Technology:
- Information Technology
- Architecture > Real Time Systems (1.00)
- Artificial Intelligence (1.00)
- Communications > Mobile (0.81)
- Information Technology