RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs

Open in new window