XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation Ziyi Wang Y anbo Wang

Feb-16-2026, 10:31:48 GMT–Neural Information Processing Systems

Subsequently, the generated 2D masks are employed to align mask-level 3D representations with the vision-language feature space, thereby augmenting the open vocabulary capability of 3D geometry embeddings.

large language model, machine learning, segmentation, (21 more...)

Neural Information Processing Systems

Feb-16-2026, 10:31:48 GMT

Conferences PDF

Country:
- Asia > China (0.04)

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Information Technology (0.67)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (0.93)
  - Artificial Intelligence
    - Vision (1.00)
    - Representation & Reasoning (0.93)
    - Natural Language > Large Language Model (0.68)
    - Machine Learning > Neural Networks
      - Deep Learning (0.46)

Duplicate Docs Excel Report

Title
88f00376aedb947af123a7868fce3e58-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found