Glossary of Terminology in Reinforcement Learning
The key distinguishing feature of RL methods is that they learn policies indirectly, by instead learning value functions. RL methods can be constrasted with direct optimization methods, such as genetic algorithms (GA), which attempt to search the policy space directly.
Jan-18-2017, 10:17:49 GMT
- Technology: