A Toy Model of Universality: Reverse Engineering How Networks Learn Group Operations

Open in new window