AITopics

Industry: Health & Medicine > Therapeutic Area > Neurology (0.60)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (0.60)
Information Technology > Artificial Intelligence > Machine Learning (0.40)

Neural Information Processing SystemsFeb-9-2026, 16:05:45 GMT

25c0fe7b157821dd3140727dc07461da-Paper-Conference.pdf

appearance change, dataset, gaussian, (17 more...)

Country:

Europe > Czechia > Prague (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Neural Information Processing SystemsNov-20-2025, 22:57:28 GMT

Deep Attentive Tracking via Reciprocative Learning

deep attentive tracking, name change, reciprocative learning, (5 more...)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.60)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (0.60)
Information Technology > Artificial Intelligence > Machine Learning (0.40)

Neural Information Processing SystemsOct-9-2025, 21:14:19 GMT

WildGaussians: 3D Gaussian Splatting in the Wild

The work was done during an academic visit to ETH Zurich.

appearance change, dataset, gaussian, (16 more...)

Country:

Europe > Switzerland > Zürich > Zürich (0.24)
Europe > Czechia > Prague (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

arXiv.org Artificial IntelligenceJul-17-2025

VISTA: Monocular Segmentation-Based Mapping for Appearance and View-Invariant Global Localization

Shafferman, Hannah, Thomas, Annika, Kinnari, Jouko, Ricard, Michael, Nino, Jose, How, Jonathan

Global localization is critical for autonomous navigation, particularly in scenarios where an agent must localize within a map generated in a different session or by another agent, as agents often have no prior knowledge about the correlation between reference frames. However, this task remains challenging in unstructured environments due to appearance changes induced by viewpoint variation, seasonal changes, spatial aliasing, and occlusions -- known failure modes for traditional place recognition methods. To address these challenges, we propose VISTA (View-Invariant Segmentation-Based Tracking for Frame Alignment), a novel open-set, monocular global localization framework that combines: 1) a front-end, object-based, segmentation and tracking pipeline, followed by 2) a submap correspondence search, which exploits geometric consistencies between environment maps to align vehicle reference frames. VISTA enables consistent localization across diverse camera viewpoints and seasonal changes, without requiring any domain-specific training or finetuning. We evaluate VISTA on seasonal and oblique-angle aerial datasets, achieving up to a 69% improvement in recall over baseline methods. Furthermore, we maintain a compact object-based map that is only 0.6% the size of the most memory-conservative baseline, making our approach capable of real-time implementation on resource-constrained platforms.

artificial intelligence, localization, machine learning, (17 more...)

2507.11653

Country:

Europe (0.68)
North America > United States > Massachusetts (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Chen, Yuxuan, Xu, Binbin, Dümbgen, Frederike, Barfoot, Timothy D.

What to Learn: Features, Image Transformations, or Both?

arXiv.org Artificial IntelligenceJun-22-2023

Long-term visual localization is an essential problem in robotics and computer vision, but remains challenging due to the environmental appearance changes caused by lighting and seasons. While many existing works have attempted to solve it by directly learning invariant sparse keypoints and descriptors to match scenes, these approaches still struggle with adverse appearance changes. Recent developments in image transformations such as neural style transfer have emerged as an alternative to address such appearance gaps. In this work, we propose to combine an image transformation network and a feature-learning network to improve long-term localization performance. Given night-to-day image pairs, the image transformation network transforms the night images into day-like conditions prior to feature matching; the feature network learns to detect keypoint locations with their associated descriptor values, which can be passed to a classical pose estimator to compute the relative poses. We conducted various experiments to examine the effectiveness of combining style transfer and feature learning and its training strategy, showing that such a combination greatly improves long-term localization performance.

artificial intelligence, featnet, machine learning, (19 more...)

2306.1304

Country: North America > Canada > Ontario > Toronto (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Neural Information Processing SystemsApr-6-2023, 15:53:38 GMT

Incremental Learning for Visual Tracking

We recorded a sequence to demonstrate that our tracker performs well in outdoor environ- ment where lighting conditions change drastically. The video was acquired when a person walking underneath a trellis covered by vines. As shown in Figure 3, the cast shadow changes the appearance of the target face drastically. Furthermore, the combined pose and lighting variation with low frame rate makes the tracking task extremely difficult. Nev- ertheless, the results show that our tracker successfully follows the target accurately and robustly. Due to heavy shadows and drastic lighting change, other tracking methods based on gradient, contour, or color information are unlikely to perform well in this case.

algorithm, eigenspace representation, representation, (16 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)

Chen, Yuxuan, Barfoot, Timothy D.

Self-Supervised Feature Learning for Long-Term Metric Visual Localization

arXiv.org Artificial IntelligenceNov-30-2022

Visual localization is the task of estimating camera pose in a known scene, which is an essential problem in robotics and computer vision. However, long-term visual localization is still a challenge due to the environmental appearance changes caused by lighting and seasons. While techniques exist to address appearance changes using neural networks, these methods typically require ground-truth pose information to generate accurate image correspondences or act as a supervisory signal during training. In this paper, we present a novel self-supervised feature learning framework for metric visual localization. We use a sequence-based image matching algorithm across different sequences of images (i.e., experiences) to generate image correspondences without ground-truth labels. We can then sample image pairs to train a deep neural network that learns sparse features with associated descriptors and scores without ground-truth pose supervision. The learned features can be used together with a classical pose estimator for visual stereo localization. We validate the learned features by integrating with an existing Visual Teach & Repeat pipeline to perform closed-loop localization experiments under different lighting conditions for a total of 22.4 km.

artificial intelligence, image correspondence, machine learning, (17 more...)

2212.00122

Country: North America > Canada > Ontario > Toronto (0.14)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Kinnari, Jouko, Verdoja, Francesco, Kyrki, Ville

Season-invariant GNSS-denied visual localization for UAVs

arXiv.org Artificial IntelligenceAug-3-2022

Localization without Global Navigation Satellite Systems (GNSS) is a critical functionality in autonomous operations of unmanned aerial vehicles (UAVs). Vision-based localization on a known map can be an effective solution, but it is burdened by two main problems: places have different appearance depending on weather and season, and the perspective discrepancy between the UAV camera image and the map make matching hard. In this work, we propose a localization solution relying on matching of UAV camera images to georeferenced orthophotos with a trained convolutional neural network model that is invariant to significant seasonal appearance difference (winter-summer) between the camera image and map. We compare the convergence speed and localization accuracy of our solution to six reference methods. The results show major improvements with respect to reference methods, especially under high seasonal variation. We finally demonstrate the ability of the method to successfully localize a real UAV, showing that the proposed method is robust to perspective changes.

experiment, localization, similarity measure, (16 more...)

doi: 10.1109/LRA.2022.3191038

2110.01967

Country:

Europe > Sweden (0.04)
Europe > Finland > Uusimaa > Helsinki (0.04)

Genre: Research Report > New Finding (0.66)

Industry:

Information Technology > Robotics & Automation (0.48)
Aerospace & Defense > Aircraft (0.48)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

#artificialintelligenceNov-2-2020, 03:45:06 GMT

Generate Younger & Older Versions of Yourself!

A team of researchers from Adobe Research developed a new technique for age transformation synthesis based on only one picture from the person. It can generate the lifespan pictures from any picture you sent it. Just watch how good their results are compared to the previous state-of-the-art methods. It is super realistic and every picture really does seem like the same person at different ages. This is typically called the problem of single photo age progression and regression where the goal is to predict how a person might look in the future, or how they looked in the past.

artificial intelligence, generate younger & older version, machine learning, (8 more...)

#artificialintelligence

Genre: Research Report > Promising Solution (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.36)