A Implementation Details 1 A.1 Preliminary Study 2 The basic GPT-2 model