Thinking Preference Optimization