AITopics | high-quality image

In this work, we investigate the task of text-to-image (T2I) synthesis under the abstract-to-intricate setting, i.e., generating intricate visual content from simple

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: Asia > Singapore (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
(2 more...)

Add feedback

29e8437db7b549160ce03d336ff66f65-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 08:16:20 GMT

algebraic error, dataset, qualitative result, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks

Neural Information Processing SystemsOct-10-2025, 16:31:02 GMT

Training models for ultra-high-resolution image generation presents significant challenges.

guidance, resolution, ultrapixel, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Guangzhou (0.04)
South America > Argentina (0.04)
North America > United States > Idaho (0.04)
(4 more...)

Genre: Research Report > Experimental Study (0.93)

Industry: Leisure & Entertainment (0.92)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

29e8437db7b549160ce03d336ff66f65-Supplemental-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 08:18:55 GMT

algebraic error, dataset, qualitative result, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

Generation of Indian Sign Language Letters, Numbers, and Words

Yadav, Ajeet Kumar, Kumar, Nishant, N, Rathna G

arXiv.org Artificial IntelligenceAug-14-2025

Sign language, which contains hand movements, facial expressions and bodily gestures, is a significant medium for communicating with hard-of-hearing people. A well-trained sign language community communicates easily, but those who don't know sign language face significant challenges. Recognition and generation are basic communication methods between hearing and hard-of-hearing individuals. Despite progress in recognition, sign language generation still needs to be explored. The Progressive Growing of Generative Adversarial Network (ProGAN) excels at producing high-quality images, while the Self-Attention Generative Adversarial Network (SAGAN) generates feature-rich images at medium resolutions. Balancing resolution and detail is crucial for sign language image generation. We are developing a Generative Adversarial Network (GAN) variant that combines both models to generate feature-rich, high-resolution, and class-conditional sign language images. Our modified Attention-based model generates high-quality images of Indian Sign Language letters, numbers, and words, outperforming the traditional ProGAN in Inception Score (IS) and Fréchet Inception Distance (FID), with improvements of 3.2 and 30.12, respectively. Additionally, we are publishing a large dataset incorporating high-quality images of Indian Sign Language alphabets, numbers, and 129 words.

artificial intelligence, deep learning, machine learning, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/IACIS61494.2024.10721847

2508.09522

Country: Asia > India (0.15)

Genre: Research Report (0.40)

Industry: Education > Curriculum > Subject-Specific Education (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Restoring Real-World Images with an Internal Detail Enhancement Diffusion Model

Xiao, Peng, Zhao, Hongbo, Wang, Yijun, Lin, Jianxin

arXiv.org Artificial IntelligenceMay-28-2025

Restoring real-world degraded images, such as old photographs or low-resolution images, presents a significant challenge due to the complex, mixed degradations they exhibit, such as scratches, color fading, and noise. Recent data-driven approaches have struggled with two main challenges: achieving high-fidelity restoration and providing object-level control over colorization. While diffusion models have shown promise in generating high-quality images with specific controls, they often fail to fully preserve image details during restoration. In this work, we propose an internal detail-preserving diffusion model for high-fidelity restoration of real-world degraded images. Our method utilizes a pre-trained Stable Diffusion model as a generative prior, eliminating the need to train a model from scratch. Central to our approach is the Internal Image Detail Enhancement (IIDE) technique, which directs the diffusion model to preserve essential structural and textural information while mitigating degradation effects. The process starts by mapping the input image into a latent space, where we inject the diffusion denoising process with degradation operations that simulate the effects of various degradation factors. Extensive experiments demonstrate that our method significantly outperforms state-of-the-art models in both qualitative assessments and perceptual quantitative evaluations. Additionally, our approach supports text-guided restoration, enabling object-level colorization control that mimics the expertise of professional photo editing.

artificial intelligence, diffusion model, machine learning, (11 more...)

arXiv.org Artificial Intelligence

2505.18674

Genre:

Research Report > Promising Solution (0.67)
Research Report > New Finding (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

PCDiff: Proactive Control for Ownership Protection in Diffusion Models with Watermark Compatibility

Gai, Keke, Shen, Ziyue, Yu, Jing, Zhu, Liehuang, Wu, Qi

arXiv.org Artificial IntelligenceApr-17-2025

With the growing demand for protecting the intellectual property (IP) of text-to-image diffusion models, we propose PCDiff -- a proactive access control framework that redefines model authorization by regulating generation quality. At its core, PCDIFF integrates a trainable fuser module and hierarchical authentication layers into the decoder architecture, ensuring that only users with valid encrypted credentials can generate high-fidelity images. In the absence of valid keys, the system deliberately degrades output quality, effectively preventing unauthorized exploitation.Importantly, while the primary mechanism enforces active access control through architectural intervention, its decoupled design retains compatibility with existing watermarking techniques. This satisfies the need of model owners to actively control model ownership while preserving the traceability capabilities provided by traditional watermarking approaches.Extensive experimental evaluations confirm a strong dependency between credential verification and image quality across various attack scenarios. Moreover, when combined with typical post-processing operations, PCDIFF demonstrates powerful performance alongside conventional watermarking methods. This work shifts the paradigm from passive detection to proactive enforcement of authorization, laying the groundwork for IP management of diffusion models.

artificial intelligence, diffusion model, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2504.11774

Country: Asia > China (0.15)

Genre: Research Report (0.84)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Filters

Collaborating Authors

high-quality image

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

fa64505ebdc94531087bc81251ce2376-Paper-Conference.pdf

c9028f7874df04843e7bf435ee4cd3c3-Paper-Conference.pdf

fa64505ebdc94531087bc81251ce2376-Supplemental-Conference.pdf

Shengqiong Wu1 Hao Fei

29e8437db7b549160ce03d336ff66f65-Supplemental-Conference.pdf

UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks

29e8437db7b549160ce03d336ff66f65-Supplemental-Conference.pdf

Generation of Indian Sign Language Letters, Numbers, and Words

Restoring Real-World Images with an Internal Detail Enhancement Diffusion Model

PCDiff: Proactive Control for Ownership Protection in Diffusion Models with Watermark Compatibility