Multimodal active speaker detection and virtual cinematography for video conferencing

Open in new window