Chunk-aware Alignment and Lexical Constraint for Visual Entailment with Natural Language Explanations

Open in new window