Reinforcement Learning with Quantum Variational Circuits
The general formulation of reinforcement learning can be defined by an agent interacting with an environment attempting to maximize its reward function. This is often formulated as a Markov Decision Process (MDP).
Aug-20-2020, 06:40:44 GMT
- Technology: