Getting ViT in Shape: Scaling Laws for Compute-Optimal Model Design Ibrahim Alabdulmohsin null, Xiaohua Zhai null, Alexander Kolesnikov, Lucas Beyer null

Open in new window