One Ref: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling
–Neural Information Processing Systems
In this paper, we propose OneRef, a minimalist referring framework built on the modality-shared one-tower transformer that unifies the visual and linguistic feature spaces.
Neural Information Processing Systems
Oct-10-2025, 22:21:42 GMT
- Country:
- Asia
- Europe > Netherlands
- North Holland > Amsterdam (0.04)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Information Technology (0.46)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Natural Language
- Chatbot (0.68)
- Large Language Model (0.93)
- Text Processing (0.93)
- Vision (1.00)
- Machine Learning > Neural Networks
- Communications (1.00)
- Sensing and Signal Processing > Image Processing (1.00)
- Artificial Intelligence
- Information Technology