ImitatingLanguagevia ScalableInverseReinforcementLearning

Neural Information Processing Systems 

The majority of language model training builds on imitation learning.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found