AITopics

Country:

South America > Uruguay > Maldonado > Maldonado (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Spain > Galicia > Madrid (0.04)

Industry: Transportation (0.69)

Technology:

Information Technology > Artificial Intelligence > Vision (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Neural Information Processing SystemsFeb-14-2026, 00:45:25 GMT

Domes to Drones: Self-Supervised Active Triangulation for 3D Human Pose Reconstruction

In contrast, 3d pose estimation from a single image is ill-posed due to occlusion and depth ambiguities.

artificial intelligence, machine learning, reconstruction, (16 more...)

Country:

North America > Canada (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Humanoid Robots (0.42)
Information Technology > Artificial Intelligence > Vision > Video Understanding (0.35)

Neural Information Processing SystemsAug-20-2025, 08:34:27 GMT

STREETS: A Novel Camera Network Dataset for Traffic Flow

Corey Snyder, Minh Do

In this paper, we introduce STREETS, a novel traffic flow dataset from publicly available web cameras in the suburbs of Chicago, IL. We seek to address the limitations of existing datasets in this area. Many such datasets lack a coherent traffic network graph to describe the relationship between sensors.

dataset, graph, vehicle, (15 more...)

Country:

North America > United States > Illinois > Cook County > Chicago (0.24)
North America > United States > New York (0.04)
North America > United States > Minnesota (0.04)
(9 more...)

Industry:

Transportation > Ground > Road (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Information Technology (0.93)
(2 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
(3 more...)

arXiv.org Artificial IntelligenceMar-19-2025

The PanAf-FGBG Dataset: Understanding the Impact of Backgrounds in Wildlife Behaviour Recognition

Brookes, Otto, Kukushkin, Maksim, Mirmehdi, Majid, Stephens, Colleen, Dieguez, Paula, Hicks, Thurston C., Jones, Sorrel, Lee, Kevin, McCarthy, Maureen S., Meier, Amelia, Normand, Emmanuelle, Wessling, Erin G., Wittig, Roman M., Langergraber, Kevin, Zuberbühler, Klaus, Boesch, Lukas, Schmid, Thomas, Arandjelovic, Mimi, Kühl, Hjalmar, Burghardt, Tilo

Computer vision analysis of camera trap video footage is essential for wildlife conservation, as captured behaviours offer some of the earliest indicators of changes in population health. Recently, several high-impact animal behaviour datasets and methods have been introduced to encourage their use; however, the role of behaviour-correlated background information and its significant effect on out-of-distribution generalisation remain unexplored. In response, we present the PanAf-FGBG dataset, featuring 20 hours of wild chimpanzee behaviours, recorded at over 350 individual camera locations. Uniquely, it pairs every video with a chimpanzee (referred to as a foreground video) with a corresponding background video (with no chimpanzee) from the same camera location. We present two views of the dataset: one with overlapping camera locations and one with disjoint locations. This setup enables, for the first time, direct evaluation of in-distribution and out-of-distribution conditions, and for the impact of backgrounds on behaviour recognition models to be quantified. All clips come with rich behavioural annotations and metadata including unique camera IDs and detailed textual scene descriptions. Additionally, we establish several baselines and present a highly effective latent-space normalisation technique that boosts out-of-distribution performance by +5.42% mAP for convolutional and +3.75% mAP for transformer-based models. Finally, we provide an in-depth analysis on the role of backgrounds in out-of-distribution behaviour recognition, including the so far unexplored impact of background durations (i.e., the count of background frames within foreground videos).

artificial intelligence, machine learning, recognition, (17 more...)

2502.21201

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Africa > Uganda (0.05)
Europe > Germany > Saxony > Leipzig (0.04)
(10 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Neural Information Processing SystemsJan-25-2025, 11:42:28 GMT

Reviews: PerspectiveNet: A Scene-consistent Image Generator for New View Synthesis in Real Indoor Environments

Given few RGBD images of a real indoor scene as well as camera locations where these were taken, the algorithm predicts RGBD images takes from different camera locations. The novelty is the use of denoising auto-encoder for a given view and finding latent representations that are consistent for different views. Detailed comments: - It would be good if the whole process was described in steps because it wasn't clear what the overall approach is from the start (may be it would be for someone working on a similar topic). Some figures are good, but could be better - together with such description. Something like the following would be useful for me: A) We are given a set of RGBD views along with camera locations of a given scene.

camera location, real indoor environment, scene-consistent image generator, (7 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.41)

Ginio, Noam, Lindenbaum, Michael, Fishbain, Barak, Liberzon, Dan

Dataset of polarimetric images of mechanically generated water surface waves coupled with surface elevation records by wave gauges linear array

arXiv.org Artificial IntelligenceOct-30-2024

Effective spatio-temporal measurements of water surface elevation (water waves) in laboratory experiments are essential for scientific and engineering research. Existing techniques are often cumbersome, computationally heavy and generally suffer from limited wavenumber/frequency response. To address these challenges a novel method was developed, using polarization filter equipped camera as the main sensor and Machine Learning (ML) algorithms for data processing [1,2]. The developed method training and evaluation was based on in-house made supervised dataset. Here we present this supervised dataset of polarimetric images of the water surface coupled with the water surface elevation measurements made by a linear array of resistance-type wave gauges (WG). The water waves were mechanically generated in a laboratory waves basin, and the polarimetric images were captured under an artificial light source. Meticulous camera and WGs calibration and instruments synchronization supported high spatio-temporal resolution. The data set covers several wavefield conditions, from simple monochromatic wave trains of various steepness, to irregular wavefield of JONSWAP prescribed spectral shape and several wave breaking scenarios. The dataset contains measurements repeated in several camera positions relative to the wave field propagation direction.

artificial intelligence, machine learning, polarimetric image, (15 more...)

2410.22849

Country:

Asia > Middle East > Israel (0.16)
Europe (0.14)

Genre: Research Report (1.00)

Industry:

Energy > Oil & Gas > Upstream (0.42)
Media > Photography (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.94)

Ginio, Noam, Lindenbaum, Michael, Fishbain, Barak, Liberzon, Dan

Wave (from) Polarized Light Learning (WPLL) method: high resolution spatio-temporal measurements of water surface waves in laboratory setups

arXiv.org Artificial IntelligenceOct-19-2024

Effective spatio-temporal measurements of water surface elevation (water waves) in laboratory experiments are essential for scientific and engineering research. Existing techniques are often cumbersome, computationally heavy and generally suffer from limited wavenumber/frequency response. To address this challenge, we propose the Wave (from) Polarized Light Learning (WPLL), a learning based remote sensing method for laboratory implementation, capable of inferring surface elevation and slope maps in high resolution. The method uses the polarization properties of the light reflected from the water surface. The WPLL uses a deep neural network (DNN) model that approximates the water surface slopes from the polarized light intensities. Once trained on simple monochromatic wave trains, the WPLL is capable of producing high-resolution and accurate reconstruction of the 2D water surface slopes and elevation in a variety of irregular wave fields. The method's robustness is demonstrated by showcasing its high wavenumber/frequency response, its ability to reconstruct wave fields propagating in arbitrary angles relative to the camera optical axis, and its computational efficiency. This developed methodology is an accurate and cost-effective near-real time remote sensing tool for laboratory water surface waves measurements, setting the path for upscaling to open sea application for research, monitoring, and short-time forecasting.

artificial intelligence, machine learning, reconstruction, (20 more...)

2410.14988

Genre: Research Report (0.82)

Industry:

Energy > Oil & Gas > Upstream (0.71)
Energy > Renewable (0.69)
Water & Waste Management > Water Management > Lifecycle > Test/Measurement (0.60)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Kienzle, Daniel, Lorenz, Julian, Ludwig, Katja, Lienhart, Rainer

Towards Learning Monocular 3D Object Localization From 2D Labels using the Physical Laws of Motion

arXiv.org Artificial IntelligenceNov-29-2023

We present a novel method for precise 3D object localization in single images from a single calibrated camera using only 2D labels. No expensive 3D labels are needed. Thus, instead of using 3D labels, our model is trained with easy-to-annotate 2D labels along with the physical knowledge of the object's motion. Given this information, the model can infer the latent third dimension, even though it has never seen this information during training. Our method is evaluated on both synthetic and real-world datasets, and we are able to achieve a mean distance error of just 6 cm in our experiments on real data. The results indicate the method's potential as a step towards learning 3D object location estimation, where collecting 3D data for training is not feasible.

camera location, dataset, video, (16 more...)

2310.17462

Country:

North America > Canada > Ontario > Hamilton (0.04)
Europe > Germany (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.66)

Industry: Leisure & Entertainment > Sports (0.94)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Moreau, Arthur, Piasco, Nathan, Tsishkou, Dzmitry, Stanciulescu, Bogdan, de La Fortelle, Arnaud

LENS: Localization enhanced by NeRF synthesis

arXiv.org Artificial IntelligenceOct-13-2021

Neural Radiance Fields (NeRF) have recently demonstrated photo-realistic results for the task of novel view synthesis. In this paper, we propose to apply novel view synthesis to the robot relocalization problem: we demonstrate improvement of camera pose regression thanks to an additional synthetic dataset rendered by the NeRF class of algorithm. To avoid spawning novel views in irrelevant places we selected virtual camera locations from NeRF internal representation of the 3D geometry of the scene. We further improved localization accuracy of pose regressors using synthesized realistic and geometry consistent images as data augmentation during training. At the time of publication, our approach improved state of the art with a 60% lower error on Cambridge Landmarks and 7-scenes datasets. Hence, the resulting accuracy becomes comparable to structure-based methods, without any architecture modification or domain adaptation constraints. Since our method allows almost infinite generation of training data, we investigated limitations of camera pose regression depending on size and distribution of data used for training on public benchmarks. We concluded that pose regression accuracy is mostly bounded by relatively small and biased datasets rather than capacity of the pose regression model to solve the localization task.

dataset, pose regressor, synthesis, (11 more...)

2110.06558

Country: North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)