ASelf-TuningActor-CriticAlgorithm

Neural Information Processing Systems 

The general concept is to represent the training loss as a function of both the agent parameters and the hyperparameters. The agent optimizes the parameters to minimize this loss function, w.r.t the current hyperparameters.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found