First Trillion Parameter Model on HuggingFace - Mixture of Experts (MoE)

Nov-19-2022, 04:00:13 GMT–#artificialintelligence

Google AI's Switch Transformers model, a Mixture of Experts (MoE) model, that was released a few months ago is now available on HuggingFace. The model scales up to 1.6 trillion parameters and is now openly accessible. Click here to check out the model on HuggingFace. MoE models are considered to be the next step of Natural Language Processing (NLP) architectures that have highly efficient scalable properties. The architecture is considered similar to the classic T5 model, with a Feed Forward layer getting replaced by a Sparse Feed Forward Layer.

artificial intelligence, huggingface, natural language, (8 more...)

#artificialintelligence

Nov-19-2022, 04:00:13 GMT

News Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found