Dealing with Sparse Rewards Using Graph Neural Networks

Oct-15-2023–arXiv.org Artificial Intelligence

Reinforcement learning is a machine learning paradigm where an artificial agent learns the optimal behavior through interactions with a dynamic environment. Goals and purposes are explained to the agent via a scalar reward signal it receives after each interaction. Throughout the training process, the agent infers the behavior that maximizes cumulative reward, also called the return. To succeed in this task, the agent needs to explore the environment to understand which states and actions yield high rewards. On the other hand, the agent also has to exploit the rewards it has already received to adapt its behavior. This problem is known as the exploration and exploitation trade-off. This work was supported in part on Section 2 by the Strategic Project "Digital Business" within the framework of the Strategic Academic Leadership Program "Priority 2030" at the National University of Science and Technology (NUST) MISiS, in part by the Basic Research Program at the National Research University Higher School of Economics (HSE University), and in part by the Computational Resources of HPC Facilities at HSE University.

learning, reinforcement, reinforcement learning, (13 more...)

arXiv.org Artificial Intelligence

Oct-15-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Pennsylvania > Philadelphia County > Philadelphia (0.04)
- Europe
  - Portugal (0.04)
  - United Kingdom > England
    - Greater London > London (0.04)
  - Russia
    - Volga Federal District
      - Nizhny Novgorod Oblast > Nizhny Novgorod (0.04)
      - Republic of Tatarstan > Kazan (0.04)
      - Perm Krai > Perm (0.04)
    - Central Federal District > Moscow Oblast
      - Moscow (0.05)
  - Netherlands > North Holland
    - Amsterdam (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia > Russia
  - Ural Federal District > Sverdlovsk Oblast > Yekaterinburg (0.04)

Genre:
- Research Report (0.82)

Industry:
- Leisure & Entertainment > Games > Computer Games (0.94)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Neural Networks > Deep Learning (0.94)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found