CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks