AITopics | Knop, Szymon

Collaborating Authors

Knop, Szymon

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

LocoGAN -- Locally Convolutional GAN

Struski, Łukasz, Knop, Szymon, Tabor, Jacek, Daniec, Wiktor, Spurek, Przemysław

arXiv.org Artificial IntelligenceNov-2-2023

We add extra channels with spatial information to the input noise images. In the paper we construct a fully convolutional GAN model: LocoGAN, which latent space is Such architecture and design of latent space allows us to given by noise-like images of possibly different use an input of various dimensions. We use that to train our resolutions. The learning is local, i.e. we process model only on parts of the latent image, see Figure 1. We call not the whole noise-like image, but the subimages this approach local learning. Section 3 contains the detailed of a fixed size.

artificial intelligence, machine learning, resolution, (17 more...)

arXiv.org Artificial Intelligence

2002.07897

Country: Europe > Poland (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Vision (0.95)

Add feedback

Target Layer Regularization for Continual Learning Using Cramer-Wold Generator

Mazur, Marcin, Pustelnik, Łukasz, Knop, Szymon, Pagacz, Patryk, Spurek, Przemysław

arXiv.org Artificial IntelligenceNov-15-2021

The concept of continual learning (CL), which aims to reduce the distance between human and artificial intelligence, seems to be considered recently by deep learning community as one of the main challenges. Generally speaking, it means the ability of the neural network to effectively learn consecutive tasks (in either supervised or unsupervised scenarios) while trying to prevent forgetting already learned information. Therefore, when designing an appropriate strategy, it needs to be ensured that the network weights are updated in such a way that they correspond to both the current and all previous tasks. However, in practice, it is quite likely that constructed CL model will suffer from either intransigence (hard acquiring new knowledge, see Chaudhry et al. [2018]) or catastrophic forgetting (CF) phenomenon (tendency to lose past knowledge, see McCloskey and Cohen [1989]). In recent years, methods of overcoming the above-mentioned problems are subject to wide and intensive investigation.

artificial intelligence, machine learning, neural network, (15 more...)

arXiv.org Artificial Intelligence

2111.07928

Country:

Europe > Poland (0.17)
North America > United States (0.14)

Genre: Research Report (0.40)

Industry: Education (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Generative models with kernel distance in data space

Knop, Szymon, Mazur, Marcin, Spurek, Przemysław, Tabor, Jacek, Podolak, Igor

arXiv.org Machine LearningSep-15-2020

Generative models dealing with modeling a~joint data distribution are generally either autoencoder or GAN based. Both have their pros and cons, generating blurry images or being unstable in training or prone to mode collapse phenomenon, respectively. The objective of this paper is to construct a~model situated between above architectures, one that does not inherit their main weaknesses. The proposed LCW generator (Latent Cramer-Wold generator) resembles a classical GAN in transforming Gaussian noise into data space. What is of utmost importance, instead of a~discriminator, LCW generator uses kernel distance. No adversarial training is utilized, hence the name generator. It is trained in two phases. First, an autoencoder based architecture, using kernel measures, is built to model a manifold of data. We propose a Latent Trick mapping a Gaussian to latent in order to get the final model. This results in very competitive FID values.

artificial intelligence, generator, neural network, (18 more...)

arXiv.org Machine Learning

2009.07327

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

One-element Batch Training by Moving Window

Spurek, Przemysław, Knop, Szymon, Tabor, Jacek, Podolak, Igor, Wójcik, Bartosz

arXiv.org Machine LearningMay-31-2019

Several deep models, esp. the generative, compare the samples from two distributions (e.g. WAE like AutoEncoder models, set-processing deep networks, etc) in their cost functions. Using all these methods one cannot train the model directly taking small size (in extreme -- one element) batches, due to the fact that samples are to be compared. We propose a generic approach to training such models using one-element mini-batches. The idea is based on splitting the batch in latent into parts: previous, i.e. historical, elements used for latent space distribution matching and the current ones, used both for latent distribution computation and the minimization process. Due to the smaller memory requirements, this allows to train networks on higher resolution images then in the classical approach.

activation, deep learning, neural network, (20 more...)

arXiv.org Machine Learning

1905.12947

Country: North America > Canada (0.14)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Sliced generative models

Knop, Szymon, Mazur, Marcin, Tabor, Jacek, Podolak, Igor, Spurek, Przemysław

arXiv.org Machine LearningJan-29-2019

In this paper we discuss a class of AutoEncoder based generative models based on one dimensional sliced approach. The idea is based on the reduction of the discrimination between samples to one-dimensional case. Our experiments show that methods can be divided into two groups. First consists of methods which are a modification of standard normality tests, while the second is based on classical distances between samples. It turns out that both groups are correct generative models, but the second one gives a slightly faster decrease rate of Fr\'{e}chet Inception Distance (FID).

artificial intelligence, autoencoder, neural network, (20 more...)

arXiv.org Machine Learning

1901.10417

Country: Europe > Poland (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.96)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.85)

Add feedback

Cramer-Wold AutoEncoder

Tabor, Jacek, Knop, Szymon, Spurek, Przemysław, Podolak, Igor, Mazur, Marcin, Jastrzębski, Stanisław

arXiv.org Artificial IntelligenceMay-23-2018

We propose a new generative model, Cramer-Wold Autoencoder (CWAE). Following WAE, we directly encourage normality of the latent space. Our paper uses also the recent idea from Sliced WAE (SWAE) model, which uses one-dimensional projections as a method of verifying closeness of two distributions. The crucial new ingredient is the introduction of a new (Cramer-Wold) metric in the space of densities, which replaces the Wasserstein metric used in SWAE. We show that the Cramer-Wold metric between Gaussian mixtures is given by a simple analytic formula, which results in the removal of sampling necessary to estimate the cost function in WAE and SWAE models. As a consequence, while drastically simplifying the optimization procedure, CWAE produces samples of a matching perceptual quality to other SOTA models.

artificial intelligence, autoencoder, neural network, (18 more...)

arXiv.org Artificial Intelligence

1805.09235

Country:

North America > United States (0.28)
Europe (0.28)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback