Pedestrian Intention Prediction via Vision-Language Foundation Models

Open in new window