Reinforcement Learning with Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation