Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up