Breakdance Video classification in the age of Generative AI
Dhar, Sauptik, Ramakrishnan, Naveen, Munson, Michelle
–arXiv.org Artificial Intelligence
Large Vision Language models have seen huge application in several sports use-cases recently. Most of these works have been targeted towards a limited subset of popular sports like soccer, cricket, basketball etc; focusing on generative tasks like visual question answering, highlight generation. This work analyzes the applicability of the modern video foundation models (both encoder and decoder) for a very niche but hugely popular dance sports - breakdance. Our results show that Video Encoder models continue to outperform state-of-the-art Video Language Models for prediction tasks. We provide insights on how to choose the encoder model and provide a thorough analysis into the workings of a finetuned decoder model for breakdance video classification.
arXiv.org Artificial Intelligence
Oct-24-2025
- Country:
- Asia
- Middle East > Saudi Arabia
- Asir Province > Abha (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Middle East > Saudi Arabia
- North America > United States
- California > Alameda County > Berkeley (0.04)
- Asia
- Genre:
- Research Report > New Finding (0.54)
- Industry:
- Leisure & Entertainment > Sports (0.48)
- Technology: