AITopics | appearance model

06997f04a7db92466a2baa6ebc8b872d-Paper.pdf

Neural Information Processing SystemsApr-24-2026, 12:12:07 GMT

Tracking objects of interest in a video is one of the most popular and widely applicable problems in computer vision. However, with the years, a Cambrian explosion of use cases and benchmarks has fragmented the problem in a multitude of different experimental setups. As a consequence, the literature has fragmented too, and now novel approaches proposed by the community are usually specialised to fit only one specific setup. To understand to what extent this specialisation is necessary, in this work we present UniTrack, a solution to address five different tasks within the same framework. UniTrack consists of a single and task-agnostic appearance model, which can be learned in a supervised or self-supervised fashion, and multiple "heads" that address individual tasks and do not require training. We show how most tracking tasks can be solved within this framework, and that the same appearance model can be successfully used to obtain results that are competitive against specialised methods for most of the tasks considered. The framework also allows us to analyse appearance models obtained with the most recent self-supervised methods, thus extending their evaluation and comparison to a larger variety of important problems.

artificial intelligence, machine learning, representation, (18 more...)

Neural Information Processing Systems

Country: Asia > China (0.28)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

06997f04a7db92466a2baa6ebc8b872d-Supplemental.pdf

Neural Information Processing SystemsFeb-18-2026, 21:53:51 GMT

cvpr, dataset, detection, (17 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Sensing and Signal Processing (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

a29f0fc2127c9d8cb1c9d86e423241af-Paper-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 03:09:51 GMT

information, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: Asia > China > Shaanxi Province (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Vision (0.95)
Information Technology > Artificial Intelligence > Natural Language (0.93)

Add feedback

DoDifferentTrackingTasksRequire DifferentAppearanceModels?

Neural Information Processing SystemsFeb-7-2026, 08:17:42 GMT

Tracking objects of interest in a video is one of the most popular and widely applicable problems in computer vision.

artificial intelligence, incvpr, machine learning, (18 more...)

Neural Information Processing Systems

Country: Asia > China > Beijing > Beijing (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.88)

Add feedback

Revisiting Motion Information for RGB-Event Tracking with MOT Philosophy

Neural Information Processing SystemsOct-10-2025, 11:54:02 GMT

Simultaneously, a Dual-Branch Transformer Decoder is designed to adopt such motion and appearance information for candidate matching, thus distinguishing between targets and distractors.

appearance model, information, tracker, (15 more...)

Neural Information Processing Systems

Country: Asia > China > Shaanxi Province (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Vision (0.95)
Information Technology > Artificial Intelligence > Natural Language (0.93)

Add feedback

06997f04a7db92466a2baa6ebc8b872d-Supplemental.pdf

Neural Information Processing SystemsOct-1-2025, 22:37:06 GMT

Long-term visual object tracking benchmark.

cvpr, dataset, detection, (17 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Sensing and Signal Processing (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Leveraging GANs For Active Appearance Models Optimized Model Fitting

Awasthi, Anurag

arXiv.org Artificial IntelligenceJan-19-2025

Generative Adversarial Networks (GANs) have gained prominence in refining model fitting tasks in computer vision, particularly in domains involving deformable models like Active Appearance Models (AAMs). This paper explores the integration of GANs to enhance the AAM fitting process, addressing challenges in optimizing nonlinear parameters associated with appearance and shape variations. By leveraging GANs' adversarial training framework, the aim is to minimize fitting errors and improve convergence rates. Achieving robust performance even in cases with high appearance variability and occlusions. Our approach demonstrates significant improvements in accuracy and computational efficiency compared to traditional optimization techniques, thus establishing GANs as a potent tool for advanced image model fitting.

appearance model, artificial intelligence, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2501.11218

Country:

North America > United States (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)

Add feedback

One-Shot Real-to-Sim via End-to-End Differentiable Simulation and Rendering

Zhu, Yifan, Xiang, Tianyi, Dollar, Aaron, Pan, Zherong

arXiv.org Artificial IntelligenceDec-8-2024

Identifying predictive world models for robots in novel environments from sparse online observations is essential for robot task planning and execution in novel environments. However, existing methods that leverage differentiable simulators to identify world models are incapable of jointly optimizing the shape, appearance, and physical properties of the scene. In this work, we introduce a novel object representation that allows the joint identification of these properties. Our method employs a novel differentiable point-based object representation coupled with a grid-based appearance field, which allows differentiable object collision detection and rendering. Combined with a differentiable physical simulator, we achieve end-to-end optimization of world models, given the sparse visual and tactile observations of a physical motion sequence. Through a series of system identification tasks in simulated and real environments, we show that our method can learn both simulation- and rendering-ready world models from only one robot action sequence.

artificial intelligence, geometry, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2412.00259

Country:

North America > United States > Washington > King County > Bellevue (0.04)
North America > United States > Oregon (0.04)
North America > United States > Connecticut > New Haven County > New Haven (0.04)
North America > Canada > British Columbia (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

An Approximate Dynamic Programming Framework for Occlusion-Robust Multi-Object Tracking

Musunuru, Pratyusha, Li, Yuchao, Weber, Jamison, Bertsekas, Dimitri

arXiv.org Artificial IntelligenceMay-23-2024

In this work, we consider data association problems involving multi-object tracking (MOT). In particular, we address the challenges arising from object occlusions. We propose a framework called approximate dynamic programming track (ADPTrack), which applies dynamic programming principles to improve an existing method called the base heuristic. Given a set of tracks and the next target frame, the base heuristic extends the tracks by matching them to the objects of this target frame directly. In contrast, ADPTrack first processes a few subsequent frames and applies the base heuristic starting from the next target frame to obtain tentative tracks. It then leverages the tentative tracks to match the objects of the target frame. This tends to reduce the occlusion-based errors and leads to an improvement over the base heuristic. When tested on the MOT17 video dataset, the proposed method demonstrates a 0.7% improvement in the association accuracy (IDF1 metric) over a state-of-the-art method that is used as the base heuristic. It also obtains improvements with respect to all the other standard metrics. Empirically, we found that the improvements are particularly pronounced in scenarios where the video data is obtained by fixed-position cameras.

base heuristic, information, tracking, (17 more...)

arXiv.org Artificial Intelligence

2405.15137

Country:

North America > United States > Arizona (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

A Generative Model for Parts-based Object Segmentation

Neural Information Processing SystemsMar-14-2024, 10:46:47 GMT

The Shape Boltzmann Machine (SBM) [1] has recently been introduced as a stateof-the-art model of foreground/background object shape. We extend the SBM to account for the foreground object's parts. Our new model, the Multinomial SBM (MSBM), can capture both local and global statistics of part shapes accurately. We combine the MSBM with an appearance model to form a fully generative model of images of objects. Parts-based object segmentations are obtained simply by performing probabilistic inference in the model. We apply the model to two challenging datasets which exhibit significant shape and appearance variability, and find that it obtains results that are comparable to the state-of-the-art. There has been significant focus in computer vision on object recognition and detection e.g.

Add feedback

Filters

Collaborating Authors

appearance model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

06997f04a7db92466a2baa6ebc8b872d-Paper.pdf

06997f04a7db92466a2baa6ebc8b872d-Supplemental.pdf

a29f0fc2127c9d8cb1c9d86e423241af-Paper-Conference.pdf

DoDifferentTrackingTasksRequire DifferentAppearanceModels?

Revisiting Motion Information for RGB-Event Tracking with MOT Philosophy

06997f04a7db92466a2baa6ebc8b872d-Supplemental.pdf

Leveraging GANs For Active Appearance Models Optimized Model Fitting

One-Shot Real-to-Sim via End-to-End Differentiable Simulation and Rendering

An Approximate Dynamic Programming Framework for Occlusion-Robust Multi-Object Tracking

A Generative Model for Parts-based Object Segmentation