DeepCompress: A Dual Reward Strategy for Dynamically Exploring and Compressing Reasoning Chains