Goto

Collaborating Authors

 classifier


Diversify Your Vision Datasets with Automatic Diffusion-Based Augmentation

Neural Information Processing Systems

We introduce ALIA (Automated Language-guided Image Augmentation), a method which utilizes large vision and language models to automatically generate natural language descriptions of a dataset's domains and augment the training data via language-guided image editing.



Appendix Figure A.1: Input spikes. A. The input spikes, x

Neural Information Processing Systems

They are 300 Poisson neurons, where the first 100 encode the whisker stimulus, the next 100 encode the auditory cue and the last 100 act as an extra noise source for our model. Out of the 300 neurons, 60 of them are inhibitory (red). The input neurons project unrestrictedly to the whole RSNN. The baseline firing rate of all input neurons is 5 Hz. The whisker stimulus and auditory cue are encoded with an increase of the firing rate for 10 ms, starting 4 ms after the onset of the actual stimuli.