216f44e2d28d4e175a194492bde9148f-Paper.pdf

Feb-7-2026, 19:03:33 GMT–Neural Information Processing Systems

We assume the environment modeled as discrete-time factored-action MDP (FA-MDP)M = hS,A,P,R,γi where S is the set of states s, A is the set of vector-represented actionsa = (a1,...,am),P(s0|s,a) = Pr(st+1 = s0|st = s,at = a)isthe transition probability,R(s,a) R is the immediate reward for taking actiona in state s, and γ [0,1) is the discount factor.

action persistence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Feb-7-2026, 19:03:33 GMT

Conferences PDF

Add feedback

Country:
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (1.00)
  - Representation & Reasoning (0.88)

Duplicate Docs Excel Report

Title
Reinforcement Learning for Control with Multiple Frequencies Jongmin Lee 1, Byung-Jun Lee

Similar Docs Excel Report more

Title	Similarity	Source
None found