AITopics

Industry: Leisure & Entertainment > Sports (0.48)

Technology:

Information Technology > Artificial Intelligence > Vision (0.49)
Information Technology > Artificial Intelligence > Machine Learning (0.47)

Neural Information Processing SystemsFeb-17-2026, 23:01:50 GMT

f3bfbd65743e60c685a3845bd61ce15f-Supplemental-Conference.pdf

artificial intelligence, colorization result, machine learning, (16 more...)

Country:

Europe > Switzerland > Zürich > Zürich (0.05)
Asia > China > Beijing > Beijing (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.49)
Information Technology > Artificial Intelligence > Vision (0.31)

Neural Information Processing SystemsFeb-17-2026, 23:01:46 GMT

L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors

With the proposed novel sampling strategy, our model achieves instance-aware colorization in diverse and complex scenarios.

colorization, machine learning, natural language, (20 more...)

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
Asia > China > Hong Kong (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Neural Information Processing SystemsOct-9-2025, 11:46:15 GMT

f3bfbd65743e60c685a3845bd61ce15f-Paper-Conference.pdf

colorization, machine learning, natural language, (20 more...)

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
Asia > China > Hong Kong (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Mourchid, Youssef, Donias, Marc, Berthoumieu, Yannick, Najim, Mohamed

SPDGAN: A Generative Adversarial Network based on SPD Manifold Learning for Automatic Image Colorization

arXiv.org Artificial IntelligenceDec-20-2023

This paper addresses the automatic colorization problem, which converts a gray-scale image to a colorized one. Recent deep-learning approaches can colorize automatically grayscale images. However, when it comes to different scenes which contain distinct color styles, it is difficult to accurately capture the color characteristics. In this work, we propose a fully automatic colorization approach based on Symmetric Positive Definite (SPD) Manifold Learning with a generative adversarial network (SPDGAN) that improves the quality of the colorization results. Our SPDGAN model establishes an adversarial game between two discriminators and a generator. The latter is based on ResNet architecture with few alterations. Its goal is to generate fake colorized images without losing color information across layers through residual connections. Then, we employ two discriminators from different domains. The first one is devoted to the image pixel domain, while the second one is to the Riemann manifold domain which helps to avoid color misalignment. Extensive experiments are conducted on the Places365 and COCO-stuff databases to test the effect of each component of our SPDGAN. In addition, quantitative and qualitative comparisons with state-of-the-art methods demonstrate the effectiveness of our model by achieving more realistic colorized images with less artifacts visually, and good results of PSNR, SSIM, and FID values.

generative adversarial network, spd manifold learning, spdgan, (12 more...)

doi: 10.1007/s00521-023-08999-8

2312.13506

Country:

Europe > France (0.04)
Asia (0.04)

Genre: Research Report > Promising Solution (0.49)

Industry: Education (0.63)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceOct-23-2023

L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors

Chang, Zheng, Weng, Shuchen, Zhang, Peixuan, Li, Yu, Li, Si, Shi, Boxin

Language-based colorization produces plausible and visually pleasing colors under the guidance of user-friendly natural language descriptions. Previous methods implicitly assume that users provide comprehensive color descriptions for most of the objects in the image, which leads to suboptimal performance. In this paper, we propose a unified model to perform language-based colorization with any-level descriptions. We leverage the pretrained cross-modality generative model for its robust language understanding and rich color priors to handle the inherent ambiguity of any-level descriptions. We further design modules to align with input conditions to preserve local spatial structures and prevent the ghosting effect. With the proposed novel sampling strategy, our model achieves instance-aware colorization in diverse and complex scenarios. Extensive experimental results demonstrate our advantages of effectively handling any-level descriptions and outperforming both language-based and automatic colorization methods. The code and pretrained models are available at: https://github.com/changzheng123/L-CAD.

colorization, colorization method, colorization result, (16 more...)

2305.15217

Country:

North America > United States > District of Columbia > Washington (0.04)
North America > United States > California > Placer County (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Leisure & Entertainment > Sports > Tennis (0.68)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

arXiv.org Artificial IntelligenceJun-2-2023

Video Colorization with Pre-trained Text-to-Image Diffusion Models

Liu, Hanyuan, Xie, Minshan, Xing, Jinbo, Li, Chengze, Wong, Tien-Tsin

Video colorization is a challenging task that involves inferring plausible and temporally consistent colors for grayscale frames. In this paper, we present ColorDiffuser, an adaptation of a pre-trained text-to-image latent diffusion model for video colorization. With the proposed adapter-based approach, we repropose the pre-trained text-to-image model to accept input grayscale video frames, with the optional text description, for video colorization. To enhance the temporal coherence and maintain the vividness of colorization across frames, we propose two novel techniques: the Color Propagation Attention and Alternated Sampling Strategy. Color Propagation Attention enables the model to refine its colorization decision based on a reference latent frame, while Alternated Sampling Strategy captures spatiotemporal dependencies by using the next and previous adjacent latent frames alternatively as reference during the generative diffusion sampling steps. This encourages bidirectional color information propagation between adjacent video frames, leading to improved color consistency across frames. We conduct extensive experiments on benchmark datasets, and the results demonstrate the effectiveness of our proposed framework. The evaluations show that ColorDiffuser achieves state-of-the-art performance in video colorization, surpassing existing methods in terms of color fidelity, temporal consistency, and visual quality.

artificial intelligence, colorization, machine learning, (15 more...)

2306.01732

Country: Asia > China > Hong Kong (0.04)

Genre:

Research Report > New Finding (0.48)
Research Report > Promising Solution (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

arXiv.org Artificial IntelligenceJul-1-2017

Unsupervised Diverse Colorization via Generative Adversarial Networks

Cao, Yun, Zhou, Zhiming, Zhang, Weinan, Yu, Yong

Colorization of grayscale images has been a hot topic in computer vision. Previous research mainly focuses on producing a colored image to match the original one. However, since many colors share the same gray value, an input grayscale image could be diversely colored while maintaining its reality. In this paper, we design a novel solution for unsupervised diverse colorization. Specifically, we leverage conditional generative adversarial networks to model the distribution of real-world item colors, in which we develop a fully convolutional generator with multi-layer noise to enhance diversity, with multi-layer condition concatenation to maintain reality, and with stride 1 to keep spatial information. With such a novel network architecture, the model yields highly competitive performance on the open LSUN bedroom dataset. The Turing test of 80 humans further indicates our generated color schemes are highly convincible.

artificial intelligence, colorization, machine learning, (17 more...)

1702.06674

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Asia > China > Shanghai > Shanghai (0.04)
(13 more...)

Genre: Research Report > Experimental Study (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)