Goto

Collaborating Authors

 Media


Just Add $ 100 More: Augmenting Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem

Neural Information Processing Systems

PGT -Aug involves three key steps: (i) volumetric 3D instance reconstruction using a 2D-to-3D view synthesis model, (ii) object-level domain alignment with LiDAR intensity simulation, and (iii) a hybrid context-aware placement method from ground and map information. We demonstrate the superiority and generality of our method through performance improvements in extensive experiments conducted on popular benchmarks, i.e., nuScenes, KITTI, and Lyft, especially for the datasets with large domain gaps






SpikeReveal: Unlocking Temporal Sequences from Real Blurry Inputs with Spike Streams Kang Chen 1, 2 Shiyan Chen

Neural Information Processing Systems

Supervised methods, while effective on synthetic datasets, suffer from a significant performance decline when applied to real-world datasets, primarily due to data distribution discrepancies.



Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation

Neural Information Processing Systems

Recent advancements in generative models have significantly enhanced their capacity for image generation, enabling a wide range of applications such as image editing, completion and video editing. A specialized area within generative modeling is layout-to-image (L2I) generation, where predefined layouts of objects guide the generative process. In this study, we introduce a novel regional cross-attention module tailored to enrich layout-to-image generation. This module notably improves the representation of layout regions, particularly in scenarios where existing methods struggle with highly complex and detailed textual descriptions. Moreover, while current open-vocabulary L2I methods are trained in an open-set setting, their evaluations often occur in closed-set environments. To bridge this gap, we propose two metrics to assess L2I performance in open-vocabulary scenarios.


Tree of Attacks: Jailbreaking Black-Box LLMs Automatically

Neural Information Processing Systems

While Large Language Models (LLMs) display versatile functionality, they continue to generate harmful, biased, and toxic content, as demonstrated by the prevalence of human-designed jailbreaks .