Risk-SensitiveReinforcementLearning: Near-OptimalRisk-SampleTradeoffinRegret

Feb-11-2026, 06:27:44 GMT–Neural Information Processing Systems

We study risk-sensitive reinforcement learning in episodic Markov decision processes with unknown transition kernels, where the goal is to optimize the total reward under the risk measure of exponential utility. We propose two provably efficient model-free algorithms, Risk-Sensitive Value Iteration (RSVI) and Risk-Sensitive Q-learning (RSQ). These algorithms implement a form of risk-sensitive optimism in the face of uncertainty, which adapts to both riskseeking and risk-averse modes of exploration.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Feb-11-2026, 06:27:44 GMT

Conferences PDF

Add feedback

Country:
- Asia > Middle East
  - Jordan (0.05)
- Europe > Netherlands
  - South Holland > Delft (0.04)
- North America > Canada
  - British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.48)
  - Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret

Similar Docs Excel Report more

Title	Similarity	Source
None found