AITopics | Image Understanding

Collaborating Authors

Image Understanding

"Image understanding (IU) is the research area concerned with the design and experimentation of computer systems that integrate explicit models of a visual problem domain with one or more methods for extracting features from images and one or more methods for matching features with models using a control structure. Given a goal, or a reason for looking at a particular scene, these systems produce descriptions of both the images and the world scenes that the images represent."
– Image Understanding, by J.K. Tsotos. In Encyclopedia of Artificial Intelligence. Stuart C. Shapiro, editor. 1987. New York: John Wiley & Sons.

News Overviews Instructional Materials AI-Alerts Classics

Learning-to-Rank Meets Language: Boosting Language-Driven Ordering Alignment for Ordinal Classification

Neural Information Processing SystemsFeb-17-2026, 22:40:45 GMT

We present a novel language-driven ordering alignment method for ordinal classification.

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
Africa > Central African Republic > Ombella-M'Poko > Bimbo (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Information Technology (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.46)

Add feedback

Efficient Test-Time Adaptation for Super-Resolution with Second-Order Degradation and Reconstruction Zeshuai Deng 1 Zhuokun Chen

Neural Information Processing SystemsFeb-17-2026, 19:21:42 GMT

Image super-resolution (SR) aims to learn a mapping from low-resolution (LR) to high-resolution (HR) using paired HR-LR training images.

artificial intelligence, image understanding, machine learning, (16 more...)

Neural Information Processing Systems

Country: Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report (0.68)

Industry: Energy (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.34)

Add feedback

Appendix 1 Back imagination and Back speech

Neural Information Processing SystemsFeb-17-2026, 14:32:28 GMT

Figure 1: The illustrative examples for two proposed techniques: Back-imagination and Back-speech. Tiny ImageNet [Le and Y ang, 2015] serves as a compact version of the comprehensive ImageNet dataset. The Stanford Sentiment Treebank-2 (SST -2) [Socher et al., 2013] is a sentiment classification dataset Given the scarcity of datasets for understanding natural language in visual scenes, we introduce a novel textual entailment dataset, named Textual Natural Contextual Classification (TNCC). This dataset is formulated on the foundation of Crisscrossed Captions [Parekh et al., 2020], an image In this work, we employ a uniform experimental configuration for both textual entailment and sentiment classification tasks. For the image classification task, we employ the ResNet18 [He et al., 2015] model, which is considered more suitable for small datasets.

artificial intelligence, natural language, text processing, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.57)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.55)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.55)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.35)

Add feedback

d43621ff2dfe39d298dcd4a41937c912-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsFeb-17-2026, 07:37:54 GMT

machine learning, natural language, sketch, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Diego County > San Diego (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
Europe > Poland (0.04)
(2 more...)

Genre: Research Report > New Finding (0.67)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
(4 more...)

Add feedback

Energy Consumption Analysis Details

Neural Information Processing SystemsFeb-17-2026, 02:43:57 GMT

The spike firing rate is defined as the proportion of non-zero elements in the spike tensor. In Table S1, we present the spike firing rates for all spiking tensors in spike-driven Transformer-8-512. SNNs are theoretically more energy efficient than counterpart ANNs. We employ two types of datasets: static image classification and neuromorphic classification. ImageNet-1K is the most typical static image dataset, which is widely used in the field of image classification.

artificial intelligence, image understanding, machine learning, (14 more...)

Neural Information Processing Systems

Industry: Energy (0.52)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.95)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.56)

Add feedback

Towards In-context Scene Understanding

Neural Information Processing SystemsFeb-17-2026, 01:58:04 GMT

The resulting Hummingbird model, suitably prompted, performs various scene understanding tasks without modification while approaching the performance of specialists that have been finetuned for each task.

computer vision, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)

Add feedback

HASSOD: Hierarchical Adaptive Self-Supervised Object Detection

Neural Information Processing SystemsFeb-16-2026, 19:07:52 GMT

Through extensive experiments on prevalent image datasets, we demonstrate the superiority of HASSOD over existing methods, thereby advancing the state of the art in self-supervised object detection. Notably, we improve Mask AR from 20.2 to 22.5 on L VIS, and from 17.0 to 26.0 on SA-1B.

artificial intelligence, image understanding, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States > Illinois (0.04)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.46)

Add feedback

Dynamo-Depth: Fixing Unsupervised Depth Estimation for Dynamical Scenes

Neural Information Processing SystemsFeb-16-2026, 12:08:52 GMT

Unsupervised monocular depth estimation techniques have demonstrated encouraging results but typically assume that the scene is static.

artificial intelligence, dynamo-depth, machine learning, (11 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Israel (0.04)

Industry: Transportation > Ground (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.63)

Add feedback

Supplementary Material IEBins: Iterative Elastic Bins for Monocular Depth Estimation

Neural Information Processing SystemsFeb-16-2026, 08:38:15 GMT

Table 2 shows a similar performance trend as in NYU-Depth-v2 dataset with increasing number of bins. We report results on keyframes (selected by the ORB-SLAM2) and on all frames of sequences 01-10. The A TE (m) metric is used.

artificial intelligence, dataset, image understanding, (10 more...)

Neural Information Processing Systems

Country: