AITopics | Safin, Aleksandr

Collaborating Authors

Safin, Aleksandr

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

RePAST: Relative Pose Attention Scene Representation Transformer

Safin, Aleksandr, Duckworth, Daniel, Sajjadi, Mehdi S. M.

arXiv.org Artificial IntelligenceApr-10-2023

The Scene Representation Transformer (SRT) is a recent method to render novel views at interactive rates. Since SRT uses camera poses with respect to an arbitrarily chosen reference camera, it is not invariant to the order of the input views. As a result, SRT is not directly applicable to large-scale scenes where the reference frame would need to be changed regularly. In this work, we propose Relative Pose Attention SRT (RePAST): Instead of fixing a reference frame at the input, we inject pairwise relative camera pose information directly into the attention mechanism of the Transformers. This leads to a model that is by definition invariant to the choice of any global reference frame, while still retaining the full capabilities of the original method. Empirical results show that adding this invariance to the model does not lead to a loss in quality. We believe that this is a step towards applying fully latent transformer-based rendering methods to large-scale scenes.

artificial intelligence, machine learning, srt, (14 more...)

arXiv.org Artificial Intelligence

2304.00947

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.34)

Add feedback

Multi-sensor large-scale dataset for multi-view 3D reconstruction

Voynov, Oleg, Bobrovskikh, Gleb, Karpyshev, Pavel, Galochkin, Saveliy, Ardelean, Andrei-Timotei, Bozhenko, Arseniy, Karmanova, Ekaterina, Kopanev, Pavel, Labutin-Rymsho, Yaroslav, Rakhimov, Ruslan, Safin, Aleksandr, Serpiva, Valerii, Artemov, Alexey, Burnaev, Evgeny, Tsetserukou, Dzmitry, Zorin, Denis

arXiv.org Artificial IntelligenceMar-28-2023

We present a new multi-sensor dataset for multi-view 3D surface reconstruction. It includes registered RGB and depth data from sensors of different resolutions and modalities: smartphones, Intel RealSense, Microsoft Kinect, industrial cameras, and structured-light scanner. The scenes are selected to emphasize a diverse set of material properties challenging for existing algorithms. We provide around 1.4 million images of 107 different scenes acquired from 100 viewing directions under 14 lighting conditions. We expect our dataset will be useful for evaluation and training of 3D reconstruction algorithms and for related tasks. The dataset is available at skoltech3d.appliedai.tech.

artificial intelligence, dataset, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2203.06111

Country: North America > United States > California (0.28)

Genre: Research Report (0.50)

Industry:

Leisure & Entertainment > Sports (0.46)
Leisure & Entertainment > Games (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback