The geometry of hidden representations of large transformer models

Neural Information Processing Systems 

By analyzing the intrinsic dimension (ID) and neighbor composition, we find that the representations evolve similarly in transformers trained on protein language tasks and image reconstruction tasks.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found