On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks
Dereventsov, Anton, Vatsavai, Ranga Raju, Webster, Clayton
–arXiv.org Artificial Intelligence
In this effort we consider a reinforcement learning (RL) technique for solving personalization tasks with complex reward signals. In particular, our approach is based on state space clustering with the use of a simplistic $k$-means algorithm as well as conventional choices of the network architectures and optimization algorithms. Numerical examples demonstrate the efficiency of different RL procedures and are used to illustrate that this technique accelerates the agent's ability to learn and does not restrict the agent's performance.
arXiv.org Artificial Intelligence
Dec-24-2021
- Country:
- North America > United States
- Tennessee > Knox County
- Knoxville (0.04)
- New York > New York County
- New York City (0.14)
- Tennessee > Knox County
- Asia > Middle East
- Jordan (0.04)
- North America > United States
- Genre:
- Research Report (1.00)
- Industry:
- Technology: