Andrej Karpathy releases concise GPT implementation. Why has he bothered to do this: doesn't he work for OpenAI, at least indirectly? [D] [N]
It's nice to see a concise implementation of GPT, in pytorch, as it is true Hugging Face's Transformer's is excellent, but it is quite difficult to trace. They are trying to build it out constantly with loads of features, so you get lost. His wiki states he works for OpenAI and Tesla is at least affiliated with Openai. Also it's very far from computer vision domain, so why spend the time on an open source implementation and make some guesses on GPT-2/GPT-3. His implementation is easy to follow, which is nice, most reimplementations I see have bugs or are unecessary complex.
Aug-18-2020, 17:30:38 GMT
- Technology: