AITopics | Bergmann, Urs

Collaborating Authors

Bergmann, Urs

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Synthesizing 3D Abstractions by Inverting Procedural Buildings with Transformers

Dax, Maximilian, Berbel, Jordi, Stria, Jan, Guibas, Leonidas, Bergmann, Urs

arXiv.org Artificial IntelligenceJan-29-2025

Training is fully supervised, We generate abstractions of buildings, reflecting the essential based on a dataset of procedural buildings paired aspects of their geometry and structure, by learning with corresponding point cloud simulations. We develop to invert procedural models. We first build a dataset of various technical components tailored to the generation of abstract procedural building models paired with simulated abstractions. This includes the design of a programmatic point clouds and then learn the inverse mapping through a language to efficiently represent abstractions, its combination transformer. Given a point cloud, the trained transformer with a technique to guarantee transformer outputs consistent then infers the corresponding abstracted building in terms with the structure imposed by this language, and an of a programmatic language description.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2501.17044

Country:

North America > United States > New York (0.14)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations

Sajjadi, Mehdi S. M., Meyer, Henning, Pot, Etienne, Bergmann, Urs, Greff, Klaus, Radwan, Noha, Vora, Suhani, Lucic, Mario, Duckworth, Daniel, Dosovitskiy, Alexey, Uszkoreit, Jakob, Funkhouser, Thomas, Tagliasacchi, Andrea

arXiv.org Artificial IntelligenceNov-29-2021

A classical problem in computer vision is to infer a 3D scene representation from few images that can be used to render novel views at interactive rates. Previous work focuses on reconstructing pre-defined 3D representations, e.g. textured meshes, or implicit representations, e.g. radiance fields, and often requires input images with precise camera poses and long processing times for each novel scene. In this work, we propose the Scene Representation Transformer (SRT), a method which processes posed or unposed RGB images of a new area, infers a "set-latent scene representation", and synthesises novel views, all in a single feed-forward pass. To calculate the scene representation, we propose a generalization of the Vision Transformer to sets of images, enabling global information integration, and hence 3D reasoning. An efficient decoder transformer parameterizes the light field by attending into the scene representation to render novel views. Learning is supervised end-to-end by minimizing a novel-view reconstruction error. We show that this method outperforms recent baselines in terms of PSNR and speed on synthetic datasets, including a new dataset created for the paper. Further, we demonstrate that SRT scales to support interactive visualization and semantic segmentation of real-world outdoor environments using Street View imagery.

artificial intelligence, machine learning, representation, (18 more...)

arXiv.org Artificial Intelligence

2111.13152

Country: North America > United States > Oklahoma > Beaver County (0.25)

Genre: Research Report > New Finding (0.68)

Industry: Information Technology (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Transform the Set: Memory Attentive Generation of Guided and Unguided Image Collages

Jetchev, Nikolay, Bergmann, Urs, Yildirim, Gökhan

arXiv.org Machine LearningOct-16-2019

Cutting and pasting image segments feels intuitive: the choice of source templates gives artists flexibility in recombining existing source material. Formally, this process takes an image set as input and outputs a collage of the set elements. Such selection from sets of source templates does not fit easily in classical convolutional neural models requiring inputs of fixed size. Inspired by advances in attention and set-input machine learning, we present a novel architecture that can generate in one forward pass image collages of source templates using set-structured representations. This paper has the following contributions: (i) a novel framework for image generation called Memory Attentive Generation of Image Collages (MAGIC) which gives artists new ways to create digital collages; (ii) from the machine-learning perspective, we show a novel Generative Adversarial Networks (GAN) architecture that uses Set-Transformer layers and set-pooling to blend sets of random image samples - a hybrid non-parametric approach.

artificial intelligence, memory template, neural network, (18 more...)

arXiv.org Machine Learning

1910.07236

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Set Flow: A Permutation Invariant Normalizing Flow

Rasul, Kashif, Schuster, Ingmar, Vollgraf, Roland, Bergmann, Urs

arXiv.org Machine LearningSep-6-2019

We present a generative model that is defined on finite sets of exchangeable, potentially high dimensional, data. As the architecture is an extension of RealNVPs, it inherits all its favorable properties, such as being invertible and allowing for exact log-likelihood evaluation. We show that this architecture is able to learn finite non-i.i.d. set data distributions, learn statistical dependencies between entities of the set and is able to train and sample with variable set sizes in a computationally efficient manner. Experiments on 3D point clouds show state-of-the art likelihoods.

artificial intelligence, experiment, neural network, (17 more...)

arXiv.org Machine Learning

1909.02775

Country:

Europe > Germany (0.15)
Europe > Sweden (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.69)

Add feedback

Generating High-Resolution Fashion Model Images Wearing Custom Outfits

Yildirim, Gökhan, Jetchev, Nikolay, Vollgraf, Roland, Bergmann, Urs

arXiv.org Machine LearningAug-23-2019

Visualizing an outfit is an essential part of shopping for clothes. Due to the combinatorial aspect of combining fashion articles, the available images are limited to a pre-determined set of outfits. In this paper, we broaden these visualizations by generating high-resolution images of fashion models wearing a custom outfit under an input body pose. We show that our approach can not only transfer the style and the pose of one generated outfit to another, but also create realistic images of human bodies and garments.

artificial intelligence, fashion model, machine learning, (16 more...)

arXiv.org Machine Learning

1908.08847

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

A Hierarchical Bayesian Model for Size Recommendation in Fashion

Guigourès, Romain, Ho, Yuen King, Koriagin, Evgenii, Sheikh, Abdul-Saboor, Bergmann, Urs, Shirvany, Reza

arXiv.org Machine LearningAug-2-2019

We introduce a hierarchical Bayesian approach to tackle the challenging problem of size recommendation in e-commerce fashion. Our approach jointly models a size purchased by a customer, and its possible return event: 1. no return, 2. returned too small 3. returned too big. Those events are drawn following a multinomial distribution parameterized on the joint probability of each event, built following a hierarchy combining priors. Such a model allows us to incorporate extended domain expertise and article characteristics as prior knowledge, which in turn makes it possible for the underlying parameters to emerge thanks to sufficient data. Experiments are presented on real (anonymized) data from millions of customers along with a detailed discussion on the efficiency of such an approach within a large scale production system.

artificial intelligence, bayesian inference, customer, (18 more...)

arXiv.org Machine Learning

doi: 10.1145/3240323.3240388

1908.00825

Country:

North America > Canada (0.16)
North America > United States (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.89)

Add feedback

A Deep Learning System for Predicting Size and Fit in Fashion E-Commerce

Sheikh, Abdul-Saboor, Guigoures, Romain, Koriagin, Evgenii, Ho, Yuen King, Shirvany, Reza, Vollgraf, Roland, Bergmann, Urs

arXiv.org Machine LearningJul-23-2019

Personalized size and fit recommendations bear crucial significance for any fashion e-commerce platform. Predicting the correct fit drives customer satisfaction and benefits the business by reducing costs incurred due to size-related returns. Traditional collaborative filtering algorithms seek to model customer preferences based on their previous orders. A typical challenge for such methods stems from extreme sparsity of customer-article orders. To alleviate this problem, we propose a deep learning based content-collaborative methodology for personalized size and fit recommendation. Our proposed method can ingest arbitrary customer and article data and can model multiple individuals or intents behind a single account. The method optimizes a global set of parameters to learn population-level abstractions of size and fit relevant information from observed customer-article interactions. It further employs customer and article specific embedding variables to learn their properties. Together with learned entity embeddings, the method maps additional customer and article attributes into a latent space to derive personalized recommendations. Application of our method to two publicly available datasets demonstrate an improvement over the state-of-the-art published results. On two proprietary datasets, one containing fit feedback from fashion experts and the other involving customer purchases, we further outperform comparable methodologies, including a recent Bayesian approach for size recommendation.

customer and article, deep learning, neural network, (20 more...)

arXiv.org Machine Learning

doi: 10.1145/3298689.3347006

1907.09844

Country:

Europe > Denmark (0.16)
North America > United States (0.14)

Genre: Research Report (0.50)

Industry: Information Technology > Services > e-Commerce Services (0.62)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

A Bandit Framework for Optimal Selection of Reinforcement Learning Agents

Merentitis, Andreas, Rasul, Kashif, Vollgraf, Roland, Sheikh, Abdul-Saboor, Bergmann, Urs

arXiv.org Machine LearningFeb-10-2019

Deep Reinforcement Learning has been shown to be very successful in complex games, e.g. Atari or Go. These games have clearly defined rules, and hence allow simulation. In many practical applications, however, interactions with the environment are costly and a good simulator of the environment is not available. Further, as environments differ by application, the optimal inductive bias (architecture, hyperparameters, etc.) of a reinforcement agent depends on the application. In this work, we propose a multi-arm bandit framework that selects from a set of different reinforcement learning agents to choose the one with the best inductive bias. To alleviate the problem of sparse rewards, the reinforcement learning agents are augmented with surrogate rewards. This helps the bandit framework to select the best agents early, since these rewards are smoother and less sparse than the environment reward. The bandit has the double objective of maximizing the reward while the agents are learning and selecting the best agent after a finite number of learning steps. Our experimental results on standard environments show that the proposed framework is able to consistently select the optimal agent after a finite number of steps, while collecting more cumulative reward compared to selecting a sub-optimal architecture or uniformly alternating between different agents.

agent, artificial intelligence, computer game, (17 more...)

arXiv.org Machine Learning

1902.03657

Country:

Europe > Germany (0.16)
North America > United States (0.14)
North America > Canada (0.14)

Genre: Research Report (0.40)

Industry: Leisure & Entertainment > Games > Computer Games (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Copy the Old or Paint Anew? An Adversarial Framework for (non-) Parametric Image Stylization

Jetchev, Nikolay, Bergmann, Urs, Yildirim, Gokhan

arXiv.org Machine LearningNov-22-2018

Parametric generative deep models are state-of-the-art for photo and non-photo realistic image stylization. However, learning complicated image representations requires computeintense modelsparametrized by a huge number of weights, which in turn requires large datasets to make learning successful. Nonparametric exemplar-based generation is a technique that works well to reproduce style from small datasets, but is also computeintensive. Theseaspects are a drawback for the practice of digital AI artists: typically one wants to use a small set of stylization images, and needs a fast flexible model in order to experiment with it. With this motivation, our work has these contributions: (i) a novel stylization method called Fully Adversarial Mosaics (FAMOS) that combines the strengths of both parametric and nonparametric approaches; (ii) multiple ablations and image examples that analyze the method and show its capabilities; (iii) source code that will empower artists and machine learning researchers to use and modify FAMOS. Tiling of small stones was a classical ancient art form, and in modern times there are efficient algorithms to produce such mosaics (with non-overlapping tiles) digitally [10]. Seamless mosaics in the style of the Renaissance painter Archimboldo are more challenging, but modern deep learning methods allow efficient seamless image stylization. Neural style transfer [4] uses filter statistics (pretrained on a huge dataset) of a style image to optimize an output image.

deep learning, neural network, template, (21 more...)

arXiv.org Machine Learning

1811.09236

Country:

Europe > Germany (0.15)
North America > Canada (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)

Add feedback

Disentangling Multiple Conditional Inputs in GANs

Yildirim, Gökhan, Seward, Calvin, Bergmann, Urs

arXiv.org Machine LearningJun-20-2018

In this paper, we propose a method that disentangles the effects of multiple input conditions in Generative Adversarial Networks (GANs). In particular, we demonstrate our method in controlling color, texture, and shape of a generated garment image for computer-aided fashion design. To disentangle the effect of input attributes, we customize conditional GANs with consistency loss functions. In our experiments, we tune one input at a time and show that we can guide our network to generate novel and realistic images of clothing articles. In addition, we present a fashion design process that estimates the input attributes of an existing garment and modifies them using our generator.

artificial intelligence, neural network, texture, (18 more...)

arXiv.org Machine Learning

1806.07819

Country: Europe > United Kingdom (0.16)

Genre: Research Report (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback