AITopics | sun rgb-d dataset

Collaborating Authors

sun rgb-d dataset

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Uni3DETR: Unified 3D Detection Transformer

Neural Information Processing SystemsFeb-15-2026, 11:29:17 GMT

Existing point cloud based 3D detectors are designed for the particular scene, either indoor or outdoor ones.

artificial intelligence, detection, machine learning, (17 more...)

Neural Information Processing Systems

Country:

South America > Brazil (0.04)
Asia > China > Hong Kong (0.04)

Industry: Information Technology (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Vision (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

370fa2e691f57eb319bc263a07dad4a5-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 04:04:19 GMT

category, detection performance, sun rgb-d dataset, (10 more...)

Neural Information Processing Systems

Country: North America > United States > California > Santa Clara County > Palo Alto (0.05)

Technology: Information Technology > Artificial Intelligence > Vision (1.00)

Add feedback

370fa2e691f57eb319bc263a07dad4a5-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 04:04:16 GMT

dataset, detection, insertion, (13 more...)

Neural Information Processing Systems

Country:

South America > Brazil (0.04)
Asia > Middle East > Israel (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Uni3DETR: Unified 3D Detection Transformer

Neural Information Processing SystemsOct-8-2025, 23:25:56 GMT

Existing point cloud based 3D detectors are designed for the particular scene, either indoor or outdoor ones.

artificial intelligence, detection, machine learning, (17 more...)

Neural Information Processing Systems

Country:

South America > Brazil (0.04)
Asia > China > Hong Kong (0.04)

Industry: Information Technology (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Vision (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

370fa2e691f57eb319bc263a07dad4a5-Supplemental-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 11:02:52 GMT

category, detection performance, sun rgb-d dataset, (10 more...)

Neural Information Processing Systems

Country: North America > United States > California > Santa Clara County > Palo Alto (0.05)

Technology: Information Technology > Artificial Intelligence > Vision (1.00)

Add feedback

370fa2e691f57eb319bc263a07dad4a5-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 11:02:49 GMT

dataset, detection, insertion, (13 more...)

Neural Information Processing Systems

Country:

South America > Brazil (0.04)
Asia > Middle East > Israel (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D Detection

Ge, Yunhao, Yu, Hong-Xing, Zhao, Cheng, Guo, Yuliang, Huang, Xinyu, Ren, Liu, Itti, Laurent, Wu, Jiajun

arXiv.org Artificial IntelligenceDec-8-2023

A major challenge in monocular 3D object detection is the limited diversity and quantity of objects in real datasets. While augmenting real scenes with virtual objects holds promise to improve both the diversity and quantity of the objects, it remains elusive due to the lack of an effective 3D object insertion method in complex real captured scenes. In this work, we study augmenting complex real indoor scenes with virtual objects for monocular 3D object detection. The main challenge is to automatically identify plausible physical properties for virtual assets (e.g., locations, appearances, sizes, etc.) in cluttered real scenes. To address this challenge, we propose a physically plausible indoor 3D object insertion approach to automatically copy virtual objects and paste them into real scenes. The resulting objects in scenes have 3D bounding boxes with plausible physical locations and appearances. In particular, our method first identifies physically feasible locations and poses for the inserted objects to prevent collisions with the existing room layout. Subsequently, it estimates spatially-varying illumination for the insertion location, enabling the immersive blending of the virtual objects into the original scene with plausible appearances and cast shadows. We show that our augmentation method significantly improves existing monocular 3D object models and achieves state-of-the-art performance. For the first time, we demonstrate that a physically plausible 3D object insertion, serving as a generative data augmentation technique, can lead to significant improvements for discriminative downstream tasks such as monocular 3D object detection. Project website: https://gyhandy.github.io/3D-Copy-Paste/

dataset, detection, insertion, (13 more...)

arXiv.org Artificial Intelligence

2312.05277

Country:

South America > Brazil (0.04)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Centroid Based Concept Learning for RGB-D Indoor Scene Classification

Ayub, Ali, Wagner, Alan R.

arXiv.org Artificial IntelligenceAug-14-2020

Classifying images taken from indoor scenes is an important area of research. The development of an accurate indoor scene classifier has the potential to improve indoor localization and decision-making for domestic robots, offer new applications for wearable computer users, and generally result in better vision-based situation awareness thus impacting a wide variety of applications. The introduction of deep learning methods, the creation of numerous large-scale datasets, and the development of specialized computing hardware have all contributed to the rapid improvement in image classification performance. One reason for deep learning's success has been the ability to learn multiple layers of generic image features that can then be used on other related computer vision problems. For instance, features from object trained image classifiers have been used to train indoor scene classifiers [27]. Yet, indoor scene classification is a challenging problem on its own.

artificial intelligence, category, machine learning, (17 more...)

arXiv.org Artificial Intelligence

1911.00155

Country: North America > United States > Pennsylvania > Centre County > State College (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

DF 2 Net: Discriminative Feature Learning and Fusion Network for RGB-D Indoor Scene Classification

Li, Yabei (Institute of Automation, Chinese Academy of Sciences (CASIA)) | Zhang, Junge (Institute of Automation, Chinese Academy of Sciences (CASIA)) | Cheng, Yanhua (Tencent) | Huang, Kaiqi (Institute of Automation, Chinese Academy of Sciences (CASIA)) | Tan, Tieniu (Institute of Automation, Chinese Academy of Sciences (CASIA))

AAAI ConferencesFeb-8-2018

This paper focuses on the task of RGB-D indoor scene classification. It is a very challenging task due to two folds. 1) Learning robust representation for indoor scene is difficult because of various objects and layouts. 2) Fusing the complementary cues in RGB and Depth is nontrivial since there are large semantic gaps between the two modalities. Most existing works learn representation for classification by training a deep network with softmax loss and fuse the two modalities by simply concatenating the features of them. However, these pipelines do not explicitly consider intra-class and inter-class similarity as well as inter-modal intrinsic relationships. To address these problems, this paper proposes a Discriminative Feature Learning and Fusion Network (DF 2 Net) with two-stage training. In the first stage, to better represent scene in each modality, a deep multi-task network is constructed to simultaneously minimize the structured loss and the softmax loss. In the second stage, we design a novel discriminative fusion network which is able to learn correlative features of multiple modalities and distinctive features of each modality. Extensive analysis and experiments on SUN RGB-D Dataset and NYU Depth Dataset V2 show the superiority of DF 2 Net over other state-of-the-art methods in RGB-D indoor scene classification task.

artificial intelligence, machine learning, representation, (14 more...)

AAAI Conferences

Thirty-Second AAAI Conference on Artificial Intelligence

Genre: Research Report (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback