Bias-Variance Trade-off and Overlearning in Dynamic Decision Problems

Nov-18-2020–arXiv.org Machine Learning

Recent advances in training of neural networks make high-dimensional numerical studies feasible for decision problems in uncertain environments. Although reinforcement learning has been widely used in optimal control for several decades [6], only recently Han and E [18], Han et al. [20] combine it with Monte Carlo type regression for the off-line construction of optimal feedback actions. In these problems, the randomness and the state are observable and a training set based on historical or simulated data is readily available. One then approximates the objective functions of these problems by the empirical averages over this training data, constructing a loss function which is minimized over the network parameters. The minimizer or a near-minimizer is the trained network and it is an approximation of the optimal feedback action.

artificial intelligence, machine learning, reppen & soner overlearning, (15 more...)

arXiv.org Machine Learning

Nov-18-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New Jersey > Mercer County
    - Princeton (0.04)
  - Massachusetts > Suffolk County
    - Boston (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Germany > Bavaria
    - Upper Bavaria > Munich (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Banking & Finance (0.68)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks (1.00)
  - Statistical Learning (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found