MURKA: Multi-Reward Reinforcement Learning with Knowledge Alignment for Optimization Tasks

Open in new window