Multilinear Tensor Low-Rank Approximation for Policy-Gradient Methods in Reinforcement Learning

Open in new window