PARL: A Unified Framework for Policy Alignment in Reinforcement Learning