CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks

Open in new window