BMU-MoCo: BidirectionalMomentumUpdate forContinualVideo-LanguageModeling
–Neural Information Processing Systems
Different from the original MoCo [19] and its cross-modal versions [15, 33, 35] that utilize momentum update for only momentum encoders to maintain a large consistent queue, our BMU strategy imposes momentum update on both momentum encoders and (video/text) encoders.
Neural Information Processing Systems
Feb-10-2026, 17:59:17 GMT
- Technology: