How does Architecture Influence the Base Capabilities

Neural Information Processing Systems 

Unlike existing work focusing on the influence of scale on base capabilities, our work examines the influence of architecture on those. Specifically, our concern is: How does architecture influence the base capabilities of pre-trained language models?

Similar Docs  Excel Report  more

TitleSimilaritySource
None found