Minimalist Softmax Attention Provably Learns Constrained Boolean Functions

Open in new window