Provably Efficient UCB-type Algorithms For Learning Predictive State Representations

Open in new window