Enhancing Feature Diversity Boosts Channel-Adaptive Vision Transformers
–Neural Information Processing Systems
Multi-Channel Imaging (MCI) contains an array of challenges for encoding useful feature representations not present in traditional images. For example, images from two different satellites may both contain RGB channels, but the remaining channels can be different for each imaging source. Thus, MCI models must support a variety of channel configurations at test time. Recent work has extended traditional visual encoders for MCI, such as Vision Transformers (ViT), by supplementing pixel information with an encoding representing the channel configuration. However, these methods treat each channel equally, i.e., they do not consider the unique properties of each channel type, which can result in needless and potentially harmful redundancies in the learned features.
Neural Information Processing Systems
Mar-26-2025, 12:09:34 GMT
- Country:
- Europe > Denmark (0.14)
- North America > United States (0.14)
- Genre:
- Research Report
- Experimental Study (0.93)
- New Finding (0.67)
- Research Report
- Industry:
- Health & Medicine (0.46)
- Information Technology (0.46)
- Technology: