Rethinking Multinomial Logistic Mixture of Experts with Sigmoid Gating Function

Open in new window