AITopics | dreambooth

Collaborating Authors

dreambooth

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

baf583e395665636887b3bda9b5ec7a1-Paper-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 19:02:54 GMT

diffusion model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Saudi Arabia > Northern Borders Province > Arar (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

StyleDrop: Text-to-Image Generation in Any Style Kihyuk Sohn Nataniel Ruiz Kimin Lee Daniel Castro Chin Irina Blok Huiwen Chang

Neural Information Processing SystemsFeb-17-2026, 07:17:13 GMT

Pre-trained large text-to-image models synthesize impressive images with an appropriate use of text prompts.

artificial intelligence, machine learning, styledrop, (18 more...)

Neural Information Processing Systems

Country:

Pacific Ocean > North Pacific Ocean > San Francisco Bay > Golden Gate (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Subject-driven Text-to-Image Generation via Apprenticeship Learning Wenhu Chen Hexiang Hu Y andong Li Nataniel Ruiz Xuhui Jia Ming-Wei Chang William W. Cohen Google Deepmind

Neural Information Processing SystemsFeb-12-2026, 14:23:31 GMT

Subject-driven image generation is related to text-driven image editing but often needs to perform more sophisticated transformations to source images (e.g., rotating the view, zooming in/out, changing the pose of

artificial intelligence, machine learning, suti, (17 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Saudi Arabia > Northern Borders Province > Arar (0.04)
Asia > Middle East > Israel (0.04)

Industry: Media (0.35)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Subject-driven Text-to-Image Generation via Apprenticeship Learning

Neural Information Processing SystemsDec-25-2025, 15:16:11 GMT

Recent text-to-image generation models like DreamBooth have made remarkable progress in generating highly customized images of a target subject, by fine-tuning an ``expert model'' for a given subject from a few examples.However, this process is expensive, since a new expert model must be learned for each subject. In this paper, we present SuTI, a Subject-driven Text-to-Image generator that replaces subject-specific fine tuning with {in-context} learning.Given a few demonstrations of a new subject, SuTI can instantly generate novel renditions of the subject in different scenes, without any subject-specific optimization.SuTI is powered by {apprenticeship learning}, where a single apprentice model is learned from data generated by a massive number of subject-specific expert models. Specifically, we mine millions of image clusters from the Internet, each centered around a specific visual subject. We adopt these clusters to train a massive number of expert models, each specializing in a different subject. The apprentice model SuTI then learns to imitate the behavior of these fine-tuned experts. SuTI can generate high-quality and customized subject-specific images 20x faster than optimization-based SoTA methods. On the challenging DreamBench and DreamBench-v2, our human evaluation shows that SuTI significantly outperforms existing models like InstructPix2Pix, Textual Inversion, Imagic, Prompt2Prompt, Re-Imagen and DreamBooth.

expert model, name change, subject-driven text-to-image generation, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.82)
Information Technology > Artificial Intelligence > Vision (0.65)

Add feedback

baf583e395665636887b3bda9b5ec7a1-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 14:52:18 GMT

diffusion model, personalization, similarity, (15 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Saudi Arabia > Northern Borders Province > Arar (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry:

Information Technology (0.92)
Leisure & Entertainment (0.68)
Media (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Subject-driven Text-to-Image Generation via Apprenticeship Learning Wenhu Chen Hexiang Hu Y andong Li Nataniel Ruiz Xuhui Jia Ming-Wei Chang William W. Cohen Google Deepmind

Neural Information Processing SystemsOct-8-2025, 19:01:06 GMT

artificial intelligence, machine learning, suti, (17 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Saudi Arabia > Northern Borders Province > Arar (0.04)
Asia > Middle East > Israel (0.04)

Industry: Media (0.35)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Generating Synthetic Data via Augmentations for Improved Facial Resemblance in DreamBooth and InstantID

Ulusan, Koray, Kiefer, Benjamin

arXiv.org Artificial IntelligenceJul-18-2025

Personalizing Stable Diffusion for professional portrait generation from amateur photos faces challenges in maintaining facial resemblance. This paper evaluates the impact of augmentation strategies on two personalization methods: DreamBooth and InstantID. W e compare classical augmentations (flipping, cropping, color adjustments) with generative augmentation using InstantID's synthetic images to enrich training data. Using SDXL and a new FaceDistance metric based on FaceNet, we quantitatively assess facial similarity. Results show classical augmentations can cause artifacts harming identity retention, while InstantID improves fidelity when balanced with real images to avoid overfitting. A user study with 97 participants confirms high photorealism and preferences for InstantID's polished look versus DreamBooth's identity accuracy. Our findings inform effective augmentation strategies for personalized text-to-image generation.

artificial intelligence, instantid, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2505.03557

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)

Industry: Media (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision > Face Recognition (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Backbone Augmented Training for Adaptations

Park, Jae Wan, Kim, Junhyeok, Jun, Youngjun, Ko, Hyunah, Hwang, Seong Jae

arXiv.org Artificial IntelligenceJun-6-2025

Adaptations facilitate efficient training of large backbone models, including diffusion models for image generation and transformer-based language models. While various adaptation techniques enhance performance with minimal computational resources, limited adaptation data often leads to challenges in training. To address this, we focus on the enormous amount of backbone data used to pre-train the backbone models. We propose Backbone Augmented Training (BAT), a method that leverages backbone data to augment the adaptation dataset. First, we formulate and prove two mathematical key propositions: one establishes the validity of BAT, while the other identifies a condition under which BAT benefits adaptation. Furthermore, we introduce an advanced data selection scheme that satisfies these propositions and present ALBAT algorithm to implement this approach. ALBAT efficiently enhances adaptation training in both personalization and language generation tasks with scarce data.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2506.04288

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Subject-driven Text-to-Image Generation via Apprenticeship Learning

Neural Information Processing SystemsJan-18-2025, 19:01:27 GMT

Recent text-to-image generation models like DreamBooth have made remarkable progress in generating highly customized images of a target subject, by fine-tuning an expert model'' for a given subject from a few examples.However, this process is expensive, since a new expert model must be learned for each subject. In this paper, we present SuTI, a Subject-driven Text-to-Image generator that replaces subject-specific fine tuning with {in-context} learning.Given a few demonstrations of a new subject, SuTI can instantly generate novel renditions of the subject in different scenes, without any subject-specific optimization.SuTI is powered by {apprenticeship learning}, where a single apprentice model is learned from data generated by a massive number of subject-specific expert models. Specifically, we mine millions of image clusters from the Internet, each centered around a specific visual subject. We adopt these clusters to train a massive number of expert models, each specializing in a different subject. The apprentice model SuTI then learns to imitate the behavior of these fine-tuned experts. SuTI can generate high-quality and customized subject-specific images 20x faster than optimization-based SoTA methods.

apprenticeship learning, expert model, subject-driven text-to-image generation, (3 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.64)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.64)

Add feedback

Filters

Collaborating Authors

dreambooth

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

d33b177b69425e7685b0b1c05bd2a5e4-Paper-Conference.pdf

baf583e395665636887b3bda9b5ec7a1-Paper-Conference.pdf

StyleDrop: Text-to-Image Generation in Any Style Kihyuk Sohn Nataniel Ruiz Kimin Lee Daniel Castro Chin Irina Blok Huiwen Chang

Subject-driven Text-to-Image Generation via Apprenticeship Learning Wenhu Chen Hexiang Hu Y andong Li Nataniel Ruiz Xuhui Jia Ming-Wei Chang William W. Cohen Google Deepmind

Subject-driven Text-to-Image Generation via Apprenticeship Learning

baf583e395665636887b3bda9b5ec7a1-Paper-Conference.pdf

Subject-driven Text-to-Image Generation via Apprenticeship Learning Wenhu Chen Hexiang Hu Y andong Li Nataniel Ruiz Xuhui Jia Ming-Wei Chang William W. Cohen Google Deepmind

Generating Synthetic Data via Augmentations for Improved Facial Resemblance in DreamBooth and InstantID

Backbone Augmented Training for Adaptations

Subject-driven Text-to-Image Generation via Apprenticeship Learning