Goto

Collaborating Authors

 Africa





LanguageModelsareFew-ShotLearners

Neural Information Processing Systems

Specifically, we train GPT-3, an autoregressive language model with 175 billion parameters, 10x more than any previous nonsparse language model, and test its performance in the few-shot setting. For all tasks, GPT-3 is applied without any gradient updates or fine-tuning, with tasks andfew-shot demonstrations specified purelyviatextinteraction withthemodel.






MoleculeGenerationbyPrincipalSubgraphMining andAssembling

Neural Information Processing Systems

Nevertheless, these methods usually rely on hand-crafted or external subgraph construction, andthesubgraph assembling depends solely onlocal arrangement.