Temporal Preference Optimization for Long-Form Video Understanding