FLoC: Facility Location-Based Efficient Visual Token Compression for Long Video Understanding

Open in new window