TTOpt: A Maximum Volume Quantized Tensor Train-based Optimization and its Application to Reinforcement Learning Andrei Chertkov Roman Schutski

Open in new window