Goto

Collaborating Authors

 Europe



How does Architecture Influence the Base Capabilities

Neural Information Processing Systems

Unlike existing work focusing on the influence of scale on base capabilities, our work examines the influence of architecture on those. Specifically, our concern is: How does architecture influence the base capabilities of pre-trained language models?