5bd529d5b07b647a8863cf71e98d651a-Supplemental.pdf
–Neural Information Processing Systems
Kinetics-400 [1] is a large scale action recognition dataset with trimmed video clips of around 10-second durations. It is collected from realistic YouTube videos, which covers 400 categories of human activities. In total, it contains around240K training videos and20K validation videos. Specifically whentraining Kinetics-200/-400 from scratch, we adopt the cosine schedule of learning rate decaying with an initiallearningrateof0.1. The initial learning rate is 0.005anddecaysby 0.1atepoch20and40.
Neural Information Processing Systems
Feb-8-2026, 20:53:25 GMT
- Technology: