AITopics | generate image

Collaborating Authors

generate image

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

LEDiT: Your Length-Extrapolatable Diffusion Transformer without Positional Encoding

Neural Information Processing SystemsJun-15-2026, 14:10:43 GMT

Diffusion transformers (DiTs) struggle to generate images at resolutions higher than their training resolutions. The primary obstacle is that the explicit positional encodings (PE), such as RoPE, need extrapolating to unseen positions which degrades performance when the inference resolution differs from training. In this paper, We propose a Length-Extrapolatable Diffusion Transformer (LEDiT) to overcome this limitation. LEDiT needs no explicit PEs, thereby avoiding PE extrapolation. The key innovation of LEDiT lies in the use of causal attention. We demonstrate that causal attention can implicitly encode global positional information and show that such information facilitates extrapolation. We further introduce a locality enhancement module, which captures fine-grained local information to complement the global coarse-grained position information encoded by causal attention. Experimental results on both conditional and text-to-image generation tasks demonstrate that LEDiT supports up to 4 resolution scaling (e.g., from 256 256 to 512 512), achieving better image quality compared to the state-of-the-art length extrapolation methods. We believe that LEDiT marks a departure from the standard RoPE-based methods and offers a promising insight into length extrapolation.

machine learning, natural language, resolution, (21 more...)

Neural Information Processing Systems

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models

Neural Information Processing SystemsFeb-17-2026, 16:33:30 GMT

In this work, we present MuDI, a novel framework that enables multi-subject personalization by effectively decoupling identities from multiple subjects.

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > Middle East > Saudi Arabia > Northern Borders Province > Arar (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.94)
(2 more...)

Add feedback

94c28dcfc97557df0df6d1f7222fc384-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 09:35:08 GMT

However, most of these models do not support the other crucial ability ofagenerativemodel: generating imaginary observations bylearning thedensity of theobserveddata. Although thisability toimagine according tothedensity ofthepossible worlds plays a crucial role, e.g., in world models required for planning and model-based reinforcement

artificial intelligence, machine learning, representation, (14 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

Security News This Week: ICE Can Now Spy on Every Phone in Your Neighborhood

WIREDJan-10-2026, 11:30:00 GMT

Plus: Iran shuts down its internet amid sweeping protests, an alleged scam boss gets extradited to China, and more. After a federal agent shot and killed 37-year-old Renee Good in Minneapolis on Wednesday, WIRED surfaced December federal court testimony from the reported ICE shooter, Jonathan Ross. In it, he said he was a firearms trainer and that he has had "hundreds" of encounters with drivers in a professional capacity during enforcement actions. Separately, we looked at how the tactics behind protest policing are moving toward intentional antagonism . If you haven't seen it, here's our guide to protesting safely in the age of surveillance .

grok, internet, wired, (17 more...)

WIRED

Country:

Asia > Middle East > Iran (0.26)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.25)
South America > Venezuela (0.05)
(8 more...)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law (1.00)
Information Technology > Security & Privacy (1.00)
(2 more...)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Social Media (0.99)
Information Technology > Communications > Mobile (0.95)
Information Technology > Artificial Intelligence (0.71)

Add feedback

X Didn't Fix Grok's 'Undressing' Problem. It Just Makes People Pay for It

WIREDJan-9-2026, 15:19:18 GMT

X is only allowing "verified" users to create images with Grok. Experts say it represents the "monetization of abuse"--and anyone can still generate images on Grok's app and website. After creating thousands of "undressing" pictures of women and sexualized imagery of apparent minors, Elon Musk's X has apparently limited who can generate images with Grok. However, despite the changes, the chatbot is still being used to create "undressing" sexualized images on the platform. On Friday morning, the Grok account on X started responding to some users' requests with a message saying that image generation and editing are "currently limited to paying subscribers."

grok, image generation, wired, (15 more...)

WIRED

Country:

Europe > United Kingdom (0.15)
South America > Venezuela (0.05)
North America > United States > Minnesota (0.05)
(3 more...)

Industry:

Information Technology > Security & Privacy (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.96)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph

Neural Information Processing SystemsDec-25-2025, 01:03:43 GMT

Despite some exciting progress on high-quality image generation from structured (scene graphs) or free-form (sentences) descriptions, most of them only guarantee the image-level semantical consistency, i.e. the generated image matching the semantic meaning of the description. They still lack the investigations on synthesizing the images in a more controllable way, like finely manipulating the visual appearance of every object. Therefore, to generate the images with preferred objects and rich interactions, we propose a semi-parametric method, PasteGAN, for generating the image from the scene graph and the image crops, where spatial arrangements of the objects and their pair-wise relationships are defined by the scene graph and the object appearances are determined by the given object crops. To enhance the interactions of the objects in the output, we design a Crop Refining Network and an Object-Image Fuser to embed the objects as well as their relationships into one map. Multiple losses work collaboratively to guarantee the generated images highly respecting the crops and complying with the scene graphs while maintaining excellent image quality. A crop selector is also proposed to pick the most-compatible crops from our external object tank by encoding the interactions around the objects in the scene graph if the crops are not provided. Evaluated on Visual Genome and COCO-Stuff dataset, our proposed method significantly outperforms the SOTA methods on Inception Score, Diversity Score and Fre chet Inception Distance. Extensive experiments also demonstrate our method's ability to generate complex and diverse images with given objects.

pastegan, scene graph, semi-parametric method, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.64)

Add feedback

Memory Replay GANs: Learning to Generate New Categories without Forgetting

Chenshen Wu, Luis Herranz, Xialei Liu, yaxing wang, Joost van de Weijer, Bogdan Raducanu

Neural Information Processing SystemsNov-20-2025, 19:02:36 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, category, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
North America > Canada > Quebec > Montreal (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.96)

Add feedback

AI firm wins high court ruling after photo agency's copyright claim

The GuardianNov-4-2025, 11:54:19 GMT

Stability AI's model allows users to generate images with text prompts. Stability AI's model allows users to generate images with text prompts. There was evidence that Getty's images were used to train Stability's model, which allows users to generate images with text prompts. Stability was also found to have infringed Getty's trademarks in some cases. The judge, Mrs Justice Joanna Smith, said the question of where to strike the balance between the interests of the creative industries on one side and the AI industry on the other was "of very real societal importance".

firm win high court ruling, photo agency, stability ai, (8 more...)

The Guardian

Country:

North America > United States (0.15)
Europe > United Kingdom > Wales (0.06)
Europe > United Kingdom > Scotland (0.06)
(4 more...)

Industry: Law > Intellectual Property & Technology Law (1.00)

Technology:

Information Technology > Communications > Social Media (0.74)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)

Add feedback

Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models

Neural Information Processing SystemsOct-10-2025, 14:13:59 GMT

In this work, we present MuDI, a novel framework that enables multi-subject personalization by effectively decoupling identities from multiple subjects.

diffusion model, initialization, mudi, (16 more...)

Neural Information Processing Systems

Country: