AITopics | Asia

Supplementary Materials for Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment

Neural Information Processing SystemsApr-29-2026, 19:21:13 GMT

The details of multiple datasets for OIQA task are presented in Table A. For the dataset that contains scanpath coordinates, we can directly sample viewport sequences from it and use our network to predict the quality scores. However, it is challenging and costly to record user scanpath data for every ODI in realistic scenarios. The scanpath information is likely unavailable when evaluating the quality of a panorama. Therefore, we propose a generalized Recursive Probability Sampling (RPS) method to generate multiple pseudo viewport sequences for the panorama, which assists the network to predict an accurate quality score in a way that is similar to the observer's actual scoring process. In JUFE and JXUFE, each ODI consists of 300 viewport coordinates, recorded using a head-mounted display (HMD).

artificial intelligence, machine learning, quality assessment, (15 more...)

Neural Information Processing Systems

Country: Asia (0.15)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.72)

Add feedback

Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment

Neural Information Processing SystemsApr-29-2026, 19:21:09 GMT

Blind Omnidirectional Image Quality Assessment (BOIQA) aims to objectively assess the human perceptual quality of omnidirectional images (ODIs) without relying on pristine-quality image information. It is becoming more significant with the increasing advancement of virtual reality (VR) technology. However, the quality assessment of ODIs is severely hampered by the fact that the existing BOIQA pipeline lacks the modeling of the observer's browsing process. To tackle this issue, we propose a novel multi-sequence network for BOIQA called Assessor360, which is derived from the realistic multi-assessor ODI quality assessment procedure. Specifically, we propose a generalized Recursive Probability Sampling (RPS) method for the BOIQA task, combining content and details information to generate multiple pseudo viewport sequences from a given starting point.

artificial intelligence, human computer interaction, machine learning, (20 more...)

Neural Information Processing Systems

Country: Asia (0.28)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (1.00)
Information Technology > Artificial Intelligence > Vision (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

cba76ef96c4cd625631ab4d33285b045-Paper-Conference.pdf

Neural Information Processing SystemsApr-29-2026, 19:04:43 GMT

data mining, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country: Asia > China (0.29)

Industry: Health & Medicine (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)
Information Technology > Data Science > Data Mining > Big Data (0.46)

Add feedback

ca24eb48806df3af49e5ac59d8a46f67-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-29-2026, 18:35:12 GMT

artificial intelligence, machine learning, noise, (19 more...)

Neural Information Processing Systems

Country:

Europe > Russia (0.28)
Asia (0.28)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Sanctioned Chinese AI Firm SenseTime Releases Image Model Built for Speed

WIREDApr-29-2026, 17:23:52 GMT

With US restrictions limiting its access to advanced tech, SenseTime is doubling down on open source with a new model optimized to run on Chinese-made chips. SenseTime, a Chinese AI company best known for its facial recognition technology, released a new open source model on Tuesday that it claims can both generate and interpret images far faster than top models developed by US competitors. SenseNova U1 could help the company reclaim lost ground after it slipped from its place among the leading players in China's AI development race. The model's secret sauce is its ability to "read" images without translating them to text first, speeding up the process and reducing the amount of computing power required. "The model's entire reasoning process is no longer limited to text. It can reason with images as well," Dahua Lin, cofounder and chief scientist at SenseTime, said in an interview with WIRED.

large language model, machine learning, natural language, (15 more...)

WIRED

Country:

Asia > China (0.54)
North America > United States > California (0.15)

Industry: Information Technology > Security & Privacy (0.97)

Technology:

Information Technology > Artificial Intelligence > Robots (0.76)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

Supplementary Material for Unleashing the Full Potential of Product Quantization for Large-Scale Image Retrieval

Neural Information Processing SystemsApr-29-2026, 16:14:26 GMT

This supplementary material provides further elaboration and discussion on our work, including additional details that support our findings.

artificial intelligence, machine learning, retrieval, (15 more...)

Neural Information Processing Systems

Country: Asia > China (0.15)

Genre: Research Report > New Finding (0.35)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.41)
Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

Re Think and Re Design Graph Neural Networks in Spaces of Continuous Graph Diffusion Functionals

Neural Information Processing SystemsApr-29-2026, 13:56:25 GMT

Graphs are ubiquitous in various domains, such as social networks and biological systems. Despite the great successes of graph neural networks (GNNs) in modeling and analyzing complex graph data, the inductive bias of locality assumption, which involves exchanging information only within neighboring connected nodes, restricts GNNs in capturing long-range dependencies and global patterns in graphs. Inspired by the classic Brachistochrone problem, we seek how to devise a new inductive bias for cutting-edge graph application and present a general framework through the lens of variational analysis. The backbone of our framework is a two-way mapping between the discrete GNN model and continuous diffusion functional, which allows us to design application-specific objective function in the continuous domain and engineer discrete deep model with mathematical guarantees. First, we address over-smoothing in current GNNs.

artificial intelligence, graph, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.28)
Asia (0.28)

Genre: Research Report > New Finding (0.68)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (0.68)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

H2RBox-v2: Incorporating Symmetry for Boosting Horizontal Box Supervised Oriented Object Detection

Neural Information Processing SystemsApr-29-2026, 13:39:15 GMT

With the rapidly increasing demand for oriented object detection, e.g. in autonomous driving and remote sensing, the recently proposed paradigm involving weakly-supervised detector H2RBox for learning rotated box (RBox) from the more readily-available horizontal box (HBox) has shown promise. This paper presents H2RBox-v2, to further bridge the gap between HBox-supervised and RBox-supervised oriented object detection. Specifically, we propose to leverage the reflection symmetry via flip and rotate consistencies, using a weakly-supervised network branch similar to H2RBox, together with a novel self-supervised branch that learns orientations from the symmetry inherent in visual objects. The detector is further stabilized and enhanced by practical techniques to cope with peripheral issues e.g.

artificial intelligence, detection, machine learning, (14 more...)

Neural Information Processing Systems

Country: Asia > China (0.28)

Industry: Information Technology (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

UE4-NeRF: Neural Radiance Field for Real-Time Rendering of Large-Scale Scene

Neural Information Processing SystemsApr-29-2026, 13:39:00 GMT

Neural Radiance Field (NeRF) is an implicit 3D reconstruction method that has shown immense potential and has gained significant attention for its ability to reconstruct 3D scenes solely from a set of photographs. However, its real-time rendering capability, especially for interactive real-time rendering of large-scale scenes, has significant limitations. To address this challenge, we propose a novel neural rendering system called UE4-NeRF that is designed for real-time rendering of large-scale scenes. Our proposed approach partitions large scenes into subNeRFs, and uses polygonal meshes to represent them. In order to represent the partitioned independent scene, we initialize polygonal meshes by constructing multiple regular octahedra within the scene and the vertices of the polygonal faces are continuously optimized during the training process. Drawing inspiration from the Level of Detail (LOD) techniques, we train meshes with varying levels of detail for different observation levels. Our approach combines with the rasterization pipeline in Unreal Engine 4 (UE4), achieving real-time rendering of large-scale scenes at 4K resolution with a frame rate of up to 43 FPS. Our experimental results demonstrate that our method attains rendering quality on par with state-of-the-art approaches, while additionally offering the advantage of real-time performance.

machine learning, real time system, rendering, (13 more...)

Neural Information Processing Systems

Country: