MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks

Open in new window