FAVOR#: Sharp Attention Kernel Approximations via New Classes of Positive Random Features

Open in new window