Constructing Efficient Fact-Storing MLPs for Transformers

Open in new window