Pessimistic Causal Reinforcement Learning with Mediators for Confounded Offline Data