Goto

Collaborating Authors

 colorization


f3bfbd65743e60c685a3845bd61ce15f-Supplemental-Conference.pdf

Neural Information Processing Systems

L-CAD: Language-basedThe tricColorizationycle on the left is red, and the tricycle on the right is orange. We leverage a referring segmentation model to roughly estimate object contours mentioned in the ur description, which enables us to perform the instance-aware sampling strategy. Othe robustness of our model, we manually annotate a sequence of contours ranging from coarse to fine and visualize the corresponding colorization results. As shown in Figure 8, our model presents aG remarkable ability to produce condition-consistent colorization results even using imprecise contours. This is because the sampling is performed in the latent space using downsampled contours and the compression decoder in the pixel space could adaptively fix color bleeding issues.






Prompt-based Consistent Video Colorization

arXiv.org Artificial Intelligence

Existing video colorization methods struggle with temporal flickering or demand extensive manual input. We propose a novel approach automating high-fidelity video colorization using rich semantic guidance derived from language and segmentation. We employ a language-conditioned diffusion model to colorize grayscale frames. Guidance is provided via automatically generated object masks and textual prompts; our primary automatic method uses a generic prompt, achieving state-of-the-art results without specific color input. Temporal stability is achieved by warping color information from previous frames using optical flow (RAFT); a correction step detects and fixes inconsistencies introduced by warping. Evaluations on standard benchmarks (DAVIS30, VIDEVO20) show our method achieves state-of-the-art performance in colorization accuracy (PSNR) and visual realism (Colorfulness, CDC), demonstrating the efficacy of automated prompt-based guidance for consistent video colorization.


Uncolorable Examples: Preventing Unauthorized AI Colorization via Perception-Aware Chroma-Restrictive Perturbation

arXiv.org Artificial Intelligence

AI-based colorization has shown remarkable capability in generating realistic color images from grayscale inputs. However, it poses risks of copyright infringement -- for example, the unauthorized colorization and resale of monochrome manga and films. Despite these concerns, no effective method currently exists to prevent such misuse. To address this, we introduce the first defensive paradigm, Uncolorable Examples, which embed imperceptible perturbations into grayscale images to invalidate unauthorized colorization. To ensure real-world applicability, we establish four criteria: effectiveness, imperceptibility, transferability, and robustness. Our method, Perception-Aware Chroma-Restrictive Perturbation (PAChroma), generates Uncolorable Examples that meet these four criteria by optimizing imperceptible perturbations with a Laplacian filter to preserve perceptual quality, and applying diverse input transformations during optimization to enhance transferability across models and robustness against common post-processing (e.g., compression). Experiments on ImageNet and Danbooru datasets demonstrate that PAChroma effectively degrades colorization quality while maintaining the visual appearance. This work marks the first step toward protecting visual content from illegitimate AI colorization, paving the way for copyright-aware defenses in generative media.