AITopics | Markov Models

Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning

Neural Information Processing SystemsAug-16-2025, 16:47:30 GMT

We study risk-sensitive reinforcement learning (RL) based on the entropic risk measure. Although existing works have established non-asymptotic regret guarantees for this problem, they leave open an exponential gap between the upper and lower bounds. We identify the deficiencies in existing algorithms and their analysis that result in such a gap.

algorithm, bellman equation, equation, (12 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

ab6439fa2daf0246f92eea433bca5ac4-Paper.pdf

Neural Information Processing SystemsAug-16-2025, 16:47:27 GMT

algorithm, bellman equation, exponential bellman equation, (9 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > New York (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.98)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.98)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.48)

Add feedback

Nested Variational Inference

Neural Information Processing SystemsAug-16-2025, 16:46:09 GMT

NVI learns proposals by optimizing a divergence at each level of nesting.

artificial intelligence, machine learning, objective, (16 more...)

Neural Information Processing Systems

Country:

Europe > Netherlands > North Holland > Amsterdam (0.04)
Asia > China (0.04)

Genre: Workflow (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Sample-Efficient Reinforcement Learning of Undercomplete POMDPs

Neural Information Processing SystemsAug-16-2025, 16:31:14 GMT

In many sequential decision making settings, the agent lacks complete information about the underlying state of the system, a phenomenon known as partial observability .

algorithm, pomdp, probability, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Workflow (0.68)
Research Report > New Finding (0.46)

Industry: Health & Medicine (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.94)

Add feedback

d63fbf8c3173730f82b150c5ef38b8ff-Paper.pdf

Neural Information Processing SystemsAug-16-2025, 15:46:49 GMT

co-occurrence matrix, markov chain, matrix, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China (0.04)

Genre: Research Report (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.58)

Add feedback

To Reviewer 1

Neural Information Processing SystemsAug-16-2025, 15:46:37 GMT

We appreciate your positive feedback and will revise our presentation accordingly. Prior to this work, the walk length of DeepWalk has to be selected by cross-validation. Thank you for your comments. We appreciate your views and we would like to clarify a few points. We are open to reframing the work as "Matrix Thank you for your comments.

co-occurrence matrix, markov chain, reviewer, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.58)

Add feedback