TowardsCrowdsourcedTrainingofLargeNeural NetworksusingDecentralizedMixture-of-Experts

Neural Information Processing Systems 

Many recent breakthroughs in deep learning were achieved by training increasingly larger models on massivedatasets. However,training such models can be prohibitively expensive. For instance, the cluster used to train GPT-3 costs over $250 million2. Asaresult, most researchers cannot afford totrain state oftheart models and contribute to their development.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found