Meta-Reinforcement Learning with Discrete World Models for Adaptive Load Balancing