AITopics | part segmentation

Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation

Neural Information Processing SystemsJun-21-2026, 07:20:18 GMT

We present Stable Part Diffusion 4D (SP4D), a framework for generating paired RGB and kinematic part videos from monocular inputs. Unlike conventional part segmentation methods that rely on appearance-based semantic cues, SP4D learns to produce kinematic parts -- structural components aligned with object articulation and consistent across views and time. SP4D adopts a dual-branch diffusion model that jointly synthesizes RGB frames and corresponding part segmentation maps. To simplify architecture and flexibly enable different part counts, we introduce a spatial color encoding scheme that maps part masks to continuous RGB-like images. This encoding allows the segmentation branch to share the latent VAE from the RGB branch, while enabling part segmentation to be recovered via straightforward post-processing.

machine learning, natural language, segmentation, (18 more...)

Neural Information Processing Systems

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

PartNeXt: ANext-Generation Dataset for Fine-Grained and Hierarchical 3DPart Understanding

Neural Information Processing SystemsJun-17-2026, 10:57:11 GMT

Understanding objects at the level of their constituent parts is fundamental to advancing computer vision, graphics, and robotics. While datasets like PartNet have driven progress in 3D part understanding, their reliance on untextured geometries and expert-dependent annotation limits scalability and usability. We introduce PartNeXt, a next-generation dataset addressing these gaps with over 23,000 highquality, textured 3D models annotated with fine-grained, hierarchical part labels across 50 categories. We benchmark PartNeXt on two tasks: (1) class-agnostic part segmentation, where state-of-the-art methods (e.g., PartField, SAMPart3D) struggle with fine-grained and leaf-level parts, and (2) 3D part-centric question answering, a new benchmark for 3D-LLMs that reveals significant gaps in open-vocabulary part grounding. Additionally, training Point-SAM on PartNeXt yields substantial gains over PartNet, underscoring the dataset's superior quality and diversity.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: Asia (0.28)

Genre: Research Report > Experimental Study (1.00)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)

Add feedback

SegGraph: Leveraging Graphs of SAMSegments for Few-Shot 3DPart Segmentation

Neural Information Processing SystemsJun-14-2026, 23:00:14 GMT

This work presents a novel framework for few-shot 3D part segmentation. Recent advances have demonstrated the significant potential of 2D foundation models for low-shot 3D part segmentation. However, it is still an open problem that how to effectively aggregate 2D knowledge from foundation models to 3D. Existing methods either ignore geometric structures for 3D feature learning or neglects the high-quality grouping clues from SAM, leading to under-segmentation and inconsistent part labels. We devise a novel SAM segment graph-based propagation method, named SegGraph, to explicitly learn geometric features encoded within SAM's segmentation masks.

machine learning, natural language, segmentation, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (1.00)
Europe (1.00)

Genre: Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Robots (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

SegGraph: Leveraging Graphs of SAM Segments for Few-Shot 3D Part Segmentation

Neural Information Processing SystemsJun-10-2026, 12:21:51 GMT

This work presents a novel framework for few-shot 3D part segmentation. Recent advances have demonstrated the significant potential of 2D foundation models for low-shot 3D part segmentation. However, it is still an open problem that how to effectively aggregate 2D knowledge from foundation models to 3D. Existing methods either ignore geometric structures for 3D feature learning or neglects the high-quality grouping clues from SAM, leading to under-segmentation and inconsistent part labels. We devise a novel SAM segment graph-based propagation method, named SegGraph, to explicitly learn geometric features encoded within SAM's segmentation masks.

artificial intelligence, machine learning, proceedings, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.56)

Add feedback

OV-PARTS: Towards Open-Vocabulary Part Segmentation (Supplementary Material) Coauthor Affiliation Address email

Neural Information Processing SystemsApr-30-2026, 00:36:49 GMT

The supplementary material is organized as follows:1 Implementation Details.(Sec. Except for the Object Mask Prompt and Compositional Prompt Tuning designs,7 we follow the default architecture in the original ZSseg paper. The number of part queries is set to 50.8 All the two-stage baselines are trained with AdamW optimizer with the initial learning rate of 1e-49 and weight decay of 1e-4. A poly learning rate policy with a power of 0.9is adopted.

artificial intelligence, machine learning, pascal-part-116, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

OV-PARTS: Towards Open-Vocabulary Part Segmentation

Neural Information Processing SystemsApr-30-2026, 00:36:46 GMT

Furthermore, the large-scale vision and language models, which play a key role in the open vocabulary setting, struggle to recognize parts as effectively as objects. To comprehensively investigate and tackle these challenges, we propose an Open-Vocabulary Part Segmentation (OV-PARTS) benchmark. OV-PARTS includes refined versions of two publicly available datasets: Pascal-Part-116 and ADE20K-Part-234.

machine learning, natural language, segmentation, (13 more...)

Neural Information Processing Systems

Country: Asia (0.67)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Understanding Multi-Granularity for Open-Vocabulary Part Segmentation

Neural Information Processing SystemsMar-22-2026, 21:46:09 GMT

Open-vocabulary part segmentation (OVPS) is an emerging research area focused on segmenting fine-grained entities using diverse and previously unseen vocabularies.Our study highlights the inherent complexities of part segmentation due to intricate boundaries and diverse granularity, reflecting the knowledge-based nature of part identification.To address these challenges, we propose PartCLIPSeg, a novel framework utilizing generalized parts and object-level contexts to mitigate the lack of generalization in fine-grained parts.PartCLIPSeg integrates competitive part relationships and attention control, alleviating ambiguous boundaries and underrepresented parts.Experimental results demonstrate that PartCLIPSeg outperforms existing state-of-the-art OVPS methods, offering refined segmentation and an advanced understanding of part relationships within images.Through extensive experiments, our model demonstrated a significant improvement over the state-of-the-art models on the Pascal-Part-116, ADE20K-Part-234, and PartImageNet datasets.

artificial intelligence, name change, proceedings, (4 more...)

Neural Information Processing Systems

Genre: Research Report (0.61)

Technology: Information Technology > Artificial Intelligence (0.41)

Add feedback

f7f47a73d631c0410cbc2748a8015241-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 18:24:13 GMT

machine learning, natural language, segmentation, (19 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (0.93)

Industry: Transportation > Ground > Road (0.67)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)

Add feedback

f4f094d1c87437fb9157a9eae5a3def0-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsFeb-18-2026, 17:13:03 GMT

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.94)

Add feedback

Articulate your NeRF: Unsupervised articulated object modeling via conditional view synthesis

Neural Information Processing SystemsFeb-18-2026, 08:23:45 GMT

Articulated objects, composed of multiple rigid parts connected by joints allowing rotational or translational motion, such as doors, cupboards and spectacles are ubiquitous in our daily lives.

artificial intelligence, machine learning, object-oriented architecture, (13 more...)

Neural Information Processing Systems

Country: Asia (0.04)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.64)

Add feedback

Filters

Collaborating Authors

part segmentation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation

PartNeXt: ANext-Generation Dataset for Fine-Grained and Hierarchical 3DPart Understanding

SegGraph: Leveraging Graphs of SAMSegments for Few-Shot 3DPart Segmentation

SegGraph: Leveraging Graphs of SAM Segments for Few-Shot 3D Part Segmentation

OV-PARTS: Towards Open-Vocabulary Part Segmentation (Supplementary Material) Coauthor Affiliation Address email

OV-PARTS: Towards Open-Vocabulary Part Segmentation

Understanding Multi-Granularity for Open-Vocabulary Part Segmentation

f7f47a73d631c0410cbc2748a8015241-Paper-Conference.pdf

f4f094d1c87437fb9157a9eae5a3def0-Paper-Datasets_and_Benchmarks_Track.pdf

Articulate your NeRF: Unsupervised articulated object modeling via conditional view synthesis