Video Event Extraction via Tracking Visual States of Arguments

Open in new window