Goto

Collaborating Authors

 annotator




StoryBench: A Multifaceted Benchmark for Continuous Story Visualization

Neural Information Processing Systems

Generating video stories from text prompts is a complex task. In addition to having high visual quality, videos need to realistically adhere to a sequence of text prompts whilst being consistent throughout the frames.



COCO-Counterfactuals: Automatically Constructed Counterfactual Examples for Image-Text Pairs

Neural Information Processing Systems

Despite their demonstrated utility for NLP, multimodal counterfactual examples have been relatively unexplored due to the difficulty of creating paired image-text data with minimal counterfactual changes. To address this challenge, we introduce a scalable framework for automatic generation of counterfactual examples using text-to-image diffusion models.






Supplemental Material - Annotator: A Generic Active Learning Baseline for LiDAR Semantic Segmentation

Neural Information Processing Systems

The data is collected in Peking University and uses the same data format as SemanticKITTI. To ensure all tasks are well-defined, we formalize consistent and compatible semantic class vocabulary across the above datasets, ensuring there is a one-to-one mapping between all semantic classes. As for ASFDA and ADA settings, we have an additional warm-up stage, i.e., the network is Both source and target data have a batch size of 16. Both training loss and validation loss consistently decrease over time, indicating effective model training. We report mIoU results across existing AL approaches in Table A3.