Doubly Sparse: Sparse Mixture of Sparse Experts for Efficient Softmax Inference

Open in new window