SurgVidLM: Towards Multi-grained Surgical Video Understanding with Large Language Model

Open in new window