VideoCapsuleNet: A Simplified Network for Action Detection

Kevin Duarte, Yogesh Rawat, Mubarak Shah

Feb-13-2026, 06:17:56 GMT–Neural Information Processing Systems

Wepropose a 3D capsule network for videos, called VideoCapsuleNet: a unified network for action detection which can jointly perform pixel-wise action segmentation along with action classification. The proposed network is a generalization of capsule network from 2D to 3D, which takes a sequence of video frames as input. The 3D generalization drastically increases the number of capsules in the network, making capsule routing computationally expensive.

artificial intelligence, capsule, machine learning, (15 more...)

Neural Information Processing Systems

Feb-13-2026, 06:17:56 GMT

Conferences PDF

Add feedback

Country:
- North America > Canada > Quebec > Montreal (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)

Duplicate Docs Excel Report

Title
VideoCapsuleNet: A Simplified Network for Action Detection
VideoCapsuleNet: A Simplified Network for Action Detection

Similar Docs Excel Report more

Title	Similarity	Source
None found