Combinatorial Reinforcement Learning with Preference Feedback