On the Expressive Power of Mixture-of-Experts for Structured Complex Tasks

Open in new window