AITopics | scene graph generation

Joint Modeling of Visual Objects and Relations for Scene Graph Generation (Supplementary Material)

Neural Information Processing SystemsApr-25-2026, 14:22:48 GMT

Based on the formulation of the likelihood function pΘ(G|I) = fΘ(G,I)/ZΘ(I), we can reformulate the gradient of log-likelihood function as: ΘL(Θ) = EG pd[ Θ log fΘ(G,I)] Θ log ZΘ(I). Theorem 2. In the initialization phase, the potential function ψtriplet(r,yoh,yot) for modeling label dependency is omitted in p(G|I), yielding a simplified model distribution ˆp(G|I). Now, we can exactly derive that q(G) = ˆp(G|I). Theorem 3. In the update phase, we use the full expression of p(G|I) with the potential function ψtriplet(r,yoh,yot) for modeling label dependency. In this case, maximizing L(q) is equivalent to minimizing the KL divergence term, and the minimum occurs when q(yo) = p(yo,I).

artificial intelligence, const, triplet, (10 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec (0.15)
Asia > China (0.15)

Technology: Information Technology > Artificial Intelligence (0.49)

Add feedback

LinkNet: Relational Embedding for Scene Graph

Neural Information Processing SystemsMar-16-2026, 20:56:07 GMT

Objects and their relationships are critical contents for image understanding. A scene graph provides a structured description that captures these properties of an image. However, reasoning about the relationships between objects is very challenging and only a few recent works have attempted to solve the problem of generating a scene graph from an image. In this paper, we present a novel method that improves scene graph generation by explicitly modeling inter-dependency among the entire object instances. We design a simple and effective relational embedding module that enables our model to jointly represent connections among all related objects, rather than focus on an object in isolation. Our novel method significantly benefits two main parts of the scene graph generation task: object classification and relationship classification. Using it on top of a basic Faster R-CNN, our model achieves state-of-the-art results on the Visual Genome benchmark.

artificial intelligence, name change, proceedings, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.40)

Add feedback

ee74a6ade401e200985e2421b20bbae4-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 15:14:29 GMT

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Asia > China > Shaanxi Province > Xi'an (0.04)
Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

4D Panoptic Scene Graph Generation Jingkang Y ang

Neural Information Processing SystemsFeb-17-2026, 11:59:36 GMT

Traditional 3D scene graph methods may recognize the static elements of this scene, such as identifying a booth situated on the ground. However, a more ideal, advanced, and dynamic perception is required for real-world scenarios.

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel (0.04)
Asia > China > Hong Kong (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Add feedback

CYCLO: Cyclic Graph Transformer Approach to Multi-Object Relationship Modeling in Aerial Videos

Neural Information Processing SystemsFeb-17-2026, 04:23:13 GMT

In this paper, we introduce the new Aero-Eye dataset that focuses on multi-object relationship modeling in aerial videos.

artificial intelligence, machine learning, natural language, (15 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > Arkansas (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.93)

Industry:

Leisure & Entertainment > Sports (0.92)
Transportation > Ground > Road (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
(3 more...)

Add feedback

9ca825deb6ce588c96f880728d3b8aea-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 03:41:39 GMT

category, large language model, machine learning, (18 more...)

Neural Information Processing Systems

Country: Asia > China > Hong Kong (0.04)

Genre: Research Report (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

LinkNet: Relational Embedding for Scene Graph

Sanghyun Woo, Dahun Kim, Donghyeon Cho, In So Kweon

Neural Information Processing SystemsFeb-12-2026, 21:21:17 GMT

Neural Information Processing Systems http://nips.cc/

computer vision, module, relational, (10 more...)

Neural Information Processing Systems

Country:

Asia > South Korea > Daejeon > Daejeon (0.05)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Iterative Scene Graph Generation

Neural Information Processing SystemsFeb-10-2026, 23:18:03 GMT

artificial intelligence, conferenceon computer visionand pattern recognition, machine learning, (10 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario (0.04)
North America > Canada > British Columbia (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

Single-StageVisualRelationshipLearningusing ConditionalQueries

Neural Information Processing SystemsFeb-9-2026, 01:23:08 GMT

To address this, recent research attempts to train single-stage models that arecomputationally efficient.

artificial intelligence, detection, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Washington > King County > Seattle (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Joint Modeling of Visual Objects and Relations for Scene Graph Generation (Supplementary Material)

Neural Information Processing SystemsFeb-8-2026, 08:36:30 GMT

Now, we can exactly derive that q (G) = ˆ p( G|I) . The definitions of potential function φ and ψ follow those in JM-SGG model. Figure 1: The scene graphs generated by JM-SGG model. In these examples, factor update is able to correct some wrong relation labels ( e.g.

artificial intelligence, machine learning, triplet, (12 more...)

Neural Information Processing Systems

Country: