DOSA: Differentiable Model-Based One-Loop Search for DNN Accelerators
Hong, Charles, Huang, Qijing, Dinh, Grace, Subedar, Mahesh, Shao, Yakun Sophia
–arXiv.org Artificial Intelligence
In the hardware design space exploration process, it is critical to optimize both hardware parameters and algorithm-to-hardware mappings. Previous work has largely approached this simultaneous optimization problem by separately exploring the hardware design space and the mapspace - both individually large and highly nonconvex spaces - independently. The resulting combinatorial explosion has created significant difficulties for optimizers. In this paper, we introduce DOSA, which consists of differentiable performance models and a gradient descent-based optimization technique to simultaneously explore both spaces and identify high-performing design points. Experimental results demonstrate that DOSA outperforms random search and Bayesian optimization by 2.80x and 12.59x, respectively, in improving DNN model energy-delay product, given a similar number of samples. We also demonstrate the modularity and flexibility of DOSA by augmenting our analytical model with a learned model, allowing us to optimize buffer sizes and mappings of a real DNN accelerator and attain a 1.82x improvement in energy-delay product.
arXiv.org Artificial Intelligence
Sep-16-2025
- Country:
- Europe
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Switzerland > Vaud
- Lausanne (0.04)
- Germany > Bavaria
- North America
- Canada > Ontario
- Toronto (0.05)
- United States
- California
- Alameda County > Berkeley (0.14)
- Santa Clara County > Santa Clara (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New York > New York County
- New York City (0.04)
- Oregon > Washington County
- Hillsboro (0.04)
- California
- Canada > Ontario
- Europe
- Genre:
- Research Report > New Finding (0.66)
- Technology: