Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers

Neural Information Processing Systems 

Recent advancements in 3D Large Language Models (LLMs) have demonstrated promising capabilities for 3D scene understanding.