AI Is Using Your Likes to Get Inside Your Head

Apr-29-2025, 11:00:00 GMT–WIRED

What is the future of the like button in the age of artificial intelligence? Max Levchin--the PayPal cofounder and Affirm CEO--sees a new and hugely valuable role for liking data to train AI to arrive at conclusions more in line with those a human decisionmaker would make. It's a well-known quandary in machine learning that a computer presented with a clear reward function will engage in relentless reinforcement learning to improve its performance and maximize that reward--but that this optimization path often leads AI systems to very different outcomes than would result from humans exercising human judgment. To introduce a corrective force, AI developers frequently use what is called reinforcement learning from human feedback (RLHF). Essentially they are putting a human thumb on the scale as the computer arrives at its model by training it on data reflecting real people's actual preferences.

levchin, machine learning, reinforcement learning, (10 more...)

WIRED

Apr-29-2025, 11:00:00 GMT

News Web Page

Add feedback

Genre:
- Personal > Interview (0.36)

Industry:
- Information Technology > Services (0.36)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks > Deep Learning (0.58)
  - Reinforcement Learning (0.47)