Efficient FrameworksforGeneralizedLow-Rank MatrixBanditProblems

Neural Information Processing Systems 

As afollow-up work, [26]further released the rank-one restriction on the action feature matrices, andtheyintroduced analgorithm LowGLOC based ontheonline-to-confidenceset conversion [2]for generalized low-rank matrix bandits with O( p (d1+d2)3rT)regret bound.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found