Towards Event-oriented Long Video Understanding

Open in new window