Efficient FrameworksforGeneralizedLow-Rank MatrixBanditProblems
–Neural Information Processing Systems
As afollow-up work, [26]further released the rank-one restriction on the action feature matrices, andtheyintroduced analgorithm LowGLOC based ontheonline-to-confidenceset conversion [2]for generalized low-rank matrix bandits with O( p (d1+d2)3rT)regret bound.
Neural Information Processing Systems
Feb-10-2026, 03:44:31 GMT
- Technology: