Goto

Collaborating Authors

 concretely


ConceptMix: A Compositional Image Generation Benchmark with Controllable Difficulty

Neural Information Processing Systems

Compositionality is a critical capability in Text-to-Image (T2I) models, as it reflects their ability to understand and combine multiple concepts from text descriptions. Existing evaluations of compositional capability rely heavily on human-designed text prompts or fixed templates, limiting their diversity and complexity, and yielding low discriminative power. We propose ConceptMix, a scalable, controllable, and customizable benchmark which automatically evaluates compositional generation ability of T2I models. This is done in two stages. First, ConceptMix generates the text prompts: concretely, using categories of visual concepts (e.g., objects, colors, shapes, spatial relationships), it randomly samples an object and k-tuples of visual concepts, then uses GPT-4o to generate text prompts for image generation based on these sampled concepts.


SAFE TrainedModels

Neural Information Processing Systems

After calibrating in the first session, the slow efficient tuning parameters can capture more informativefeatures, improving generalization to incoming classes. Moreover, to further incorporate novel concepts, we strikeabalance between stability and plasticity byfixing slowefficient tuning parameters and continuously updating the fast ones. Specifically, a cross-classification loss with feature alignment is proposed to circumvent catastrophic forgetting.



Online Adaptive Methods, Universality and Acceleration

Neural Information Processing Systems

Conversely, adaptive first order methods are very popular in Machine Learning, with AdaGrad, [12],beingthemostprominent methodamongthisclass. AdaGrad isanonlinelearning algorithm which adapts its learning rate using the feedback (gradients) received through the optimization process, and is known to successfully handle noisy feedback.


Appendix

Neural Information Processing Systems

By this way,YoutubeDNN can be compatible with non-sequential recommendation task.SCANN and IPNSW are built on the learned representation of YoutubeDNN.


e8dbeb1c947a30576c699e7f5c73d3e3-Paper-Conference.pdf

Neural Information Processing Systems

However, within this specific application domain, existing VAE methods are restricted by using only one layer of latent variables andstrictly Gaussian posterior approximations.


Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

Neural Information Processing Systems

Thiscanbe formulated into anon-linear equality constrained optimization problem. Unlike GANs, solving such problem iscomputationally challenging, wethen proposed a simple yet effective procedure to decouple the alternating updates for the two networks for stability. By teaching the perturbation generator to hijacking the training trajectory of the victim classifier, the generator can thus learn to move against thevictim classifier stepbystep.


cf9dc5e4e194fc21f397b4cac9cc3ae9-Paper.pdf

Neural Information Processing Systems

However, the structure of their hidden layer representations is only theoretically well-understood incertain infinite-width limits, inwhichtheserepresentations cannot flexibly adapt tolearn data-dependent features [3-11,24]. Inthe Bayesian setting, these representations are described by fixed, deterministic kernels [3-11].