Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model
–Neural Information Processing Systems
Existing multi-modal image fusion methods fail to address the compound degradations presented in source images, resulting in fusion images plagued by noise, color bias, improper exposure, etc. Additionally, these methods often overlook the specificity of foreground objects, weakening the salience of the objects of interest within the fused images. To address these challenges, this study proposes a novel interactive multi-modal image fusion framework based on the text-modulated diffusion model, called Text-DiFuse.
Neural Information Processing Systems
May-29-2025, 08:49:00 GMT
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Health & Medicine > Diagnostic Medicine > Imaging (0.46)
- Technology: