AITopics | nonstationary reinforcement learning

An Environment Model for Nonstationary Reinforcement Learning

Neural Information Processing SystemsApr-6-2023, 17:22:36 GMT

Reinforcement learning in nonstationary environments is generally regarded as an important and yet difficult problem. This paper partially addresses the problem by formalizing a subclass of nonsta(cid:173) tionary environments. The environment model, called hidden-mode Markov decision process (HM-MDP), assumes that environmental changes are always confined to a small number of hidden modes. While HM-MDP is a special case of partially observable Markov decision processes (POMDP), modeling an HM-MDP environment via the more gen(cid:173) eral POMDP model unnecessarily increases the problem complex(cid:173) ity. A variant of the Baum-Welch algorithm is developed for model learning requiring less data and time.

environment model, markov decision process, nonstationary reinforcement learning, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

An Environment Model for Nonstationary Reinforcement Learning

Choi, Samuel P. M., Yeung, Dit-Yan, Zhang, Nevin Lianwen

Neural Information Processing SystemsDec-31-2000

Reinforcement learning in nonstationary environments is generally regarded as an important and yet difficult problem. This paper partially addresses the problem by formalizing a subclass of nonstationary environments. The environment model, called hidden-mode Markov decision process (HM-MDP), assumes that environmental changes are always confined to a small number of hidden modes.

algorithm, hidden-mode model, hm-mdp, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.04)
Asia > China > Hong Kong > Kowloon (0.04)

Industry: Transportation (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

An Environment Model for Nonstationary Reinforcement Learning

Choi, Samuel P. M., Yeung, Dit-Yan, Zhang, Nevin Lianwen

Neural Information Processing SystemsDec-31-2000

Reinforcement learning in nonstationary environments is generally regarded as an important and yet difficult problem. This paper partially addresses the problem by formalizing a subclass of nonstationary environments. The environment model, called hidden-mode Markov decision process (HM-MDP), assumes that environmental changes are always confined to a small number of hidden modes.

algorithm, hidden-mode model, hm-mdp, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.04)
Asia > China > Hong Kong > Kowloon (0.04)

Industry: Transportation (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

An Environment Model for Nonstationary Reinforcement Learning

Choi, Samuel P. M., Yeung, Dit-Yan, Zhang, Nevin Lianwen

Neural Information Processing SystemsDec-31-2000

Reinforcement learning in nonstationary environments is generally regarded as an important and yet difficult problem. This paper partially addresses the problem by formalizing a subclass of nonstationary environments.The environment model, called hidden-mode Markov decision process (HM-MDP), assumes that environmental changes are always confined to a small number of hidden modes.

artificial intelligence, hm-mdp, machine learning, (15 more...)

Neural Information Processing Systems

Country: