DeepMind now learns from human preferences – just like a toddler

Open in new window