FuseRL: Dense Preference Optimization for Heterogeneous Model Fusion