Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models

Open in new window