AITopics | indoor scene

Collaborating Authors

indoor scene

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

953d276d037e701fcd97dbb34ebb2394-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 18:48:27 GMT

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
North America > United States > Illinois > Champaign County > Urbana (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
(2 more...)

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.93)

Industry: Information Technology (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)

Add feedback

SG-Nav: Online3DSceneGraphPromptingfor LLM-basedZero-shotObjectNavigation

Neural Information Processing SystemsFeb-7-2026, 15:26:53 GMT

In the zero-shot setting, the system does notrequire anytraining orfinetuning before applied toreal-worldscenarios, andthegoal category can be freely specified by text in an open-vocabulary manner.

artificial intelligence, large language model, natural language, (18 more...)

Neural Information Processing Systems

Country: Asia > China > Beijing > Beijing (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)

Add feedback

SCRREAM : SCan, Register, REnder And Map: A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark

Neural Information Processing SystemsFeb-6-2026, 09:20:59 GMT

Traditionally, 3d indoor datasets have generally prioritized scale over ground-truth accuracy in order to obtain improved generalization. However, using these datasets to evaluate dense geometry tasks, such as depth rendering, can be problematic as the meshes of the dataset are often incomplete and may produce wrong ground truth to evaluate the details. In this paper, we propose SCRREAM, a dataset annotation framework that allows annotation of fully dense meshes of objects in the scene and registers camera poses on the real image sequence, which can produce accurate ground truth for both sparse 3D as well as dense 3D tasks. We show the details of the dataset annotation pipeline and showcase four possible variants of datasets that can be obtained from our framework with example scenes, such as indoor reconstruction and SLAM, scene editing \& object removal, human reconstruction and 6d pose estimation. Recent pipelines for indoor reconstruction and SLAM serve as new benchmarks. In contrast to previous indoor dataset, our design allows to evaluate dense geometry tasks on eleven sample scenes against accurately rendered ground truth depth maps.

artificial intelligence, indoor scene, render and map, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.40)

Add feedback

CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion

Neural Information Processing SystemsDec-25-2025, 14:46:35 GMT

Controllable scene synthesis aims to create interactive environments for numerous industrial use cases. Scene graphs provide a highly suitable interface to facilitate these applications by abstracting the scene context in a compact manner. Existing methods, reliant on retrieval from extensive databases or pre-trained shape embeddings, often overlook scene-object and object-object relationships, leading to inconsistent results due to their limited generation capacity. To address this issue, we present CommonScenes, a fully generative model that converts scene graphs into corresponding controllable 3D scenes, which are semantically realistic and conform to commonsense. Our pipeline consists of two branches, one predicting the overall scene layout via a variational auto-encoder and the other generating compatible shapes via latent diffusion, capturing global scene-object and local inter-object relationships in the scene graph while preserving shape diversity. The generated scenes can be manipulated by editing the input scene graph and sampling the noise in the diffusion model. Due to the lack of a scene graph dataset offering high-quality object-level meshes with relations, we also construct SG-FRONT, enriching the off-the-shelf indoor dataset 3D-FRONT with additional scene graph labels. Extensive experiments are conducted on SG-FRONT, where CommonScenes shows clear advantages over other methods regarding generation consistency, quality, and diversity. Codes and the dataset are available on the website.

commonscene, indoor scene, name change, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts

Zhong, Weipeng, Cao, Peizhou, Jin, Yichen, Luo, Li, Cai, Wenzhe, Lin, Jingli, Wang, Hanqing, Lyu, Zhaoyang, Wang, Tai, Dai, Bo, Xu, Xudong, Pang, Jiangmiao

arXiv.org Artificial IntelligenceOct-15-2025

The advancement of Embodied AI heavily relies on large-scale, simulatable 3D scene datasets characterized by scene diversity and realistic layouts. However, existing datasets typically suffer from limitations in data scale or diversity, sanitized layouts lacking small items, and severe object collisions. To address these shortcomings, we introduce \textbf{InternScenes}, a novel large-scale simulatable indoor scene dataset comprising approximately 40,000 diverse scenes by integrating three disparate scene sources, real-world scans, procedurally generated scenes, and designer-created scenes, including 1.96M 3D objects and covering 15 common scene types and 288 object classes. We particularly preserve massive small items in the scenes, resulting in realistic and complex layouts with an average of 41.5 objects per region. Our comprehensive data processing pipeline ensures simulatability by creating real-to-sim replicas for real-world scans, enhances interactivity by incorporating interactive objects into these scenes, and resolves object collisions by physical simulations. We demonstrate the value of InternScenes with two benchmark applications: scene layout generation and point-goal navigation. Both show the new challenges posed by the complex and realistic layouts. More importantly, InternScenes paves the way for scaling up the model training for both tasks, making the generation and navigation in such complex scenes possible. We commit to open-sourcing the data, models, and benchmarks to benefit the whole community.

machine learning, natural language, object-oriented architecture, (20 more...)

arXiv.org Artificial Intelligence

2509.10813

Genre: Research Report (1.00)

Industry: Information Technology (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.67)

Add feedback

SceneCraft: Layout-Guided 3D Scene Generation Xiuyu Y ang 1 Y unze Man 2 Jun-Kun Chen 2 Y u-Xiong Wang

Neural Information Processing SystemsOct-10-2025, 10:19:55 GMT

Although some pioneering methods have achieved automatic text-to-3D generation, they are generally limited to small-scale scenes with restricted control over the shape and texture.

layout, representation, scenecraft, (16 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
North America > United States > Illinois > Champaign County > Urbana (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
(2 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.93)
Research Report > Promising Solution (0.66)

Industry: Information Technology (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)

Add feedback

Supplementary Material CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion

Neural Information Processing SystemsOct-8-2025, 18:59:24 GMT

We provide the outcomes of this study in Figure 3 and Figure 4.

artificial intelligence, graph-to-3d, machine learning, (16 more...)

Neural Information Processing Systems

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.68)

Add feedback

Targeted Adversarial Perturbations for Monocular Depth Prediction SUPPLEMENTARY MATERIALS

Neural Information Processing SystemsOct-3-2025, 01:22:58 GMT

In Sec. 2, the robustness of perturbations against defenses is discussed.

artificial intelligence, machine learning, perturbation, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures

Han, Seungoh, Jang, Jaehoon, Kim, Hyunsu, Surh, Jaeheung, Kwak, Junhyung, Ha, Hyowon, Joo, Kyungdon

arXiv.org Artificial IntelligenceJul-9-2025

Recent advances in 3D Gaussian Splatting (3DGS) have enabled real-time novel view synthesis (NVS) with impressive quality in indoor scenes. However, achieving high-fidelity rendering requires meticulously captured images covering the entire scene, limiting accessibility for general users. We aim to develop a practical 3DGS-based NVS framework using simple panorama-style motion with a handheld camera (e.g., mobile device). While convenient, this rotation-dominant motion and narrow baseline make accurate camera pose and 3D point estimation challenging, especially in textureless indoor scenes. To address these challenges, we propose LighthouseGS, a novel framework inspired by the lighthouse-like sweeping motion of panoramic views. LighthouseGS leverages rough geometric priors, such as mobile device camera poses and monocular depth estimation, and utilizes the planar structures often found in indoor environments. We present a new initialization method called plane scaffold assembly to generate consistent 3D points on these structures, followed by a stable pruning strategy to enhance geometry and optimization stability. Additionally, we introduce geometric and photometric corrections to resolve inconsistencies from motion drift and auto-exposure in mobile devices. Tested on collected real and synthetic indoor scenes, LighthouseGS delivers photorealistic rendering, surpassing state-of-the-art methods and demonstrating the potential for panoramic view synthesis and object placement.

artificial intelligence, gaussian, indoor scene, (15 more...)

arXiv.org Artificial Intelligence

2507.06109

Country: Asia > South Korea (0.29)

Genre: Research Report > Promising Solution (0.34)

Industry: Media > Photography (0.68)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)

Add feedback

Filters

Collaborating Authors

indoor scene

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

953d276d037e701fcd97dbb34ebb2394-Paper-Conference.pdf

SG-Nav: Online3DSceneGraphPromptingfor LLM-basedZero-shotObjectNavigation

SCRREAM : SCan, Register, REnder And Map: A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark

CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion

InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts

SceneCraft: Layout-Guided 3D Scene Generation Xiuyu Y ang 1 Y unze Man 2 Jun-Kun Chen 2 Y u-Xiong Wang

Supplementary Material CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion

Targeted Adversarial Perturbations for Monocular Depth Prediction SUPPLEMENTARY MATERIALS

6030db5195150ac86d942186f4abdad8-Paper-Conference.pdf

LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures