Goto

Collaborating Authors

 Media



Large Language Model as Attributed Training Data Generator: A T ale of Diversity and Bias Yue Y u

Neural Information Processing Systems

Large language models (LLMs) have been recently leveraged as training data generators for various natural language processing (NLP) tasks. While previous research has explored different approaches to training models using generated data, they generally rely on simple class-conditional prompts, which may limit the diversity of the generated data and inherit systematic biases of LLM. Thus, we investigate training data generation with diversely attributed prompts (e.g.,




Overleaf Example

Neural Information Processing Systems

Baseline Methods As standard baselines, we first consider zero-shot CLIP (ZS) and vanilla fine-tuning (FT) with contrastive loss. We construct the label map for contrastive loss by regarding all of the samples from a class as positives. A.3 Multi-modal Classification Dataset To evaluate the multi-modal representation learning under video emotional classification, CMU-MOSEI consists of three modalities textual (T), visual (V), and audio (A), and contains 23,453 Y ouTube video clips about diverse movie reviews, and each clip is annotated with ordinal labels ranging from -3 (strong negative) to 3 (strong positive). While MulT learns the joint encoder only with standard classification loss (i.e., cross-entropy loss; Metric For image-text retrieval, we adopt top-1 and top-5 recalls likewise CLIP retrieval setup. Flickr30k for zero-shot transferred image-text retrieval.


A Dataset for Finding Humans Using Room Acoustics Mason Wang 1 Samuel Clarke

Neural Information Processing Systems

A room's acoustic properties are a product of the room's geometry, the objects within the room, and their specific positions. A room's acoustic properties can be characterized by its impulse response (RIR) between a source and listener