Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts
–Neural Information Processing Systems
Specifically, we introduce Multiway Transformer, where each block contains a pool of modality-specific experts and a shared self-attention layer.
Neural Information Processing Systems
Aug-19-2025, 05:39:03 GMT
- Country:
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Asia > China
- Heilongjiang Province > Harbin (0.04)
- Hong Kong (0.04)
- Europe
- North America
- Canada > British Columbia
- Vancouver (0.04)
- United States
- California > Los Angeles County
- Long Beach (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Washington > King County
- Seattle (0.04)
- California > Los Angeles County
- Canada > British Columbia
- Oceania > Australia
- South America > Chile
- Africa > Ethiopia
- Genre:
- Research Report > New Finding (0.46)
- Technology: