ef8b5fcc338e003145ac9c134754db71-AuthorFeedback.pdf

Neural Information Processing Systems 

Inthiswork,we1 propose thefirstfinite-time system identification algorithm forpartiallyobservable linear dynamical systems (LDS)2 inadaptive and closed-loop settings. Prior estimation methods only work when the actions/controls are iidrandom3 noise and do not allow for any exploitation or strategic exploration. Our proposed estimation5 algorithm allows the data collection with an adaptive controller and the design of fully adaptive RL methods. We6 believe this contribution alone has a great interest in both RL and control communities. Note that prior works in this area, such as [6,12,37,40-46] have been published in recent machine learning21 conferences(NeurIPS,ICML...).

Similar Docs  Excel Report  more

TitleSimilaritySource
None found