A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems

Open in new window