An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training