Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation

Feb-11-2026, 03:11:28 GMT–Neural Information Processing Systems

In the latter, the algorithm estimates the low-rank matrix corresponding to the (state, action) value function of the current policy using the following two-phase procedure.

data mining, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Feb-11-2026, 03:11:28 GMT

Conferences PDF

Country:
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- North America > United States
  - Massachusetts > Middlesex County > Cambridge (0.04)
- Europe > Sweden
  - Stockholm > Stockholm (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report > Experimental Study (0.93)

Technology:
- Information Technology
  - Data Science > Data Mining (0.93)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Machine Learning > Reinforcement Learning (0.50)

Duplicate Docs Excel Report

Title
371713c3e5314dff9483c62c5abb98a8-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found