AITopics | Tabor, Jacek

Collaborating Authors

Tabor, Jacek

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

SONG: Self-Organizing Neural Graphs

Struski, Łukasz, Danel, Tomasz, Śmieja, Marek, Tabor, Jacek, Zieliński, Bartosz

arXiv.org Artificial IntelligenceJul-28-2021

Recent years have seen a surge in research on deep interpretable neural networks with decision trees as one of the most commonly incorporated tools. There are at least three advantages of using decision trees over logistic regression classification models: they are easy to interpret since they are based on binary decisions, they can make decisions faster, and they provide a hierarchy of classes. However, one of the well-known drawbacks of decision trees, as compared to decision graphs, is that decision trees cannot reuse the decision nodes. Nevertheless, decision graphs were not commonly used in deep learning due to the lack of efficient gradient-based training techniques. In this paper, we fill this gap and provide a general paradigm based on Markov processes, which allows for efficient training of the special type of decision graphs, which we call Self-Organizing Neural Graphs (SONG). We provide an extensive theoretical study of SONG, complemented by experiments conducted on Letter, Connect4, MNIST, CIFAR, and TinyImageNet datasets, showing that our method performs on par or better than existing decision models.

decision tree learning, deep learning, probability, (18 more...)

arXiv.org Artificial Intelligence

2107.13214

Country: Europe > Poland (0.15)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

Add feedback

RegFlow: Probabilistic Flow-based Regression for Future Prediction

Zięba, Maciej, Przewięźlikowski, Marcin, Śmieja, Marek, Tabor, Jacek, Trzcinski, Tomasz, Spurek, Przemysław

arXiv.org Machine LearningNov-30-2020

Predicting future states or actions of a given system remains a fundamental, yet unsolved challenge of intelligence, especially in the scope of complex and non-deterministic scenarios, such as modeling behavior of humans. Existing approaches provide results under strong assumptions concerning unimodality of future states, or, at best, assuming specific probability distributions that often poorly fit to real-life conditions. In this work we introduce a robust and flexible probabilistic framework that allows to model future predictions with virtually no constrains regarding the modality or underlying probability distribution. To achieve this goal, we leverage a hypernetwork architecture and train a continuous normalizing flow model. The resulting method dubbed RegFlow achieves state-of-the-art results on several benchmark datasets, outperforming competing approaches by a significant margin.

deep learning, neural network, prediction, (20 more...)

arXiv.org Machine Learning

2011.1462

Country: Europe > Poland (0.48)

Genre: Research Report (0.82)

Industry: Transportation > Ground > Road (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Vision (0.95)
Information Technology > Sensing and Signal Processing (0.93)
(4 more...)

Add feedback

ProtoPShare: Prototype Sharing for Interpretable Image Classification and Similarity Discovery

Rymarczyk, Dawid, Struski, Łukasz, Tabor, Jacek, Zieliński, Bartosz

arXiv.org Artificial IntelligenceNov-29-2020

In this paper, we introduce ProtoPShare, a self-explained method that incorporates the paradigm of prototypical parts to explain its predictions. The main novelty of the ProtoPShare is its ability to efficiently share prototypical parts between the classes thanks to our data-dependent merge-pruning. Moreover, the prototypes are more consistent and the model is more robust to image perturbations than the state of the art method ProtoPNet. We verify our findings on two datasets, the CUB-200-2011 and the Stanford Cars.

artificial intelligence, neural network, prototype, (19 more...)

arXiv.org Artificial Intelligence

2011.1434

Country:

Europe > Poland (0.14)
Oceania > Australia (0.14)

Genre:

Research Report > Promising Solution (0.34)
Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.83)

Add feedback

Flow-based anomaly detection

Maziarka, Łukasz, Śmieja, Marek, Sendera, Marcin, Struski, Łukasz, Tabor, Jacek, Spurek, Przemysław

arXiv.org Machine LearningOct-6-2020

We propose OneFlow - a flow-based one-class classifier for anomaly (outliers) detection that finds a minimal volume bounding region. Contrary to density-based methods, OneFlow is constructed in such a way that its result typically does not depend on the structure of outliers. This is caused by the fact that during training the gradient of the cost function is propagated only over the points located near to the decision boundary (behavior similar to the support vectors in SVM). The combination of flow models and Bernstein quantile estimator allows OneFlow to find a parametric form of bounding region, which can be useful in various applications including describing shapes from 3D point clouds. Experiments show that the proposed model outperforms related methods on real-world anomaly detection problems.

dataset, deep learning, neural network, (18 more...)

arXiv.org Machine Learning

2010.03002

Genre: Research Report (0.93)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Generative models with kernel distance in data space

Knop, Szymon, Mazur, Marcin, Spurek, Przemysław, Tabor, Jacek, Podolak, Igor

arXiv.org Machine LearningSep-15-2020

Generative models dealing with modeling a~joint data distribution are generally either autoencoder or GAN based. Both have their pros and cons, generating blurry images or being unstable in training or prone to mode collapse phenomenon, respectively. The objective of this paper is to construct a~model situated between above architectures, one that does not inherit their main weaknesses. The proposed LCW generator (Latent Cramer-Wold generator) resembles a classical GAN in transforming Gaussian noise into data space. What is of utmost importance, instead of a~discriminator, LCW generator uses kernel distance. No adversarial training is utilized, hence the name generator. It is trained in two phases. First, an autoencoder based architecture, using kernel measures, is built to model a manifold of data. We propose a Latent Trick mapping a Gaussian to latent in order to get the final model. This results in very competitive FID values.

artificial intelligence, generator, neural network, (18 more...)

arXiv.org Machine Learning

2009.07327

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Adversarial Examples Detection and Analysis with Layer-wise Autoencoders

Wójcik, Bartosz, Morawiecki, Paweł, Śmieja, Marek, Krzyżek, Tomasz, Spurek, Przemysław, Tabor, Jacek

arXiv.org Machine LearningJun-17-2020

We present a mechanism for detecting adversarial examples based on data representations taken from the hidden layers of the target network. For this purpose, we train individual autoencoders at intermediate layers of the target network. This allows us to describe the manifold of true data and, in consequence, decide whether a given example has the same characteristics as true data. It also gives us insight into the behavior of adversarial examples and their flow through the layers of a deep neural network. Experimental results show that our method outperforms the state of the art in supervised and unsupervised settings.

adversarial example, deep learning, neural network, (19 more...)

arXiv.org Machine Learning

2006.10013

Country: North America > Canada (0.46)

Genre: Research Report > New Finding (0.34)

Industry: Information Technology > Security & Privacy (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

Kernel Self-Attention in Deep Multiple Instance Learning

Rymarczyk, Dawid, Tabor, Jacek, Zieliński, Bartosz

arXiv.org Machine LearningMay-25-2020

Multiple Instance Learning (MIL) is weakly supervised learning, which assumes that there is only one label provided for the entire bag of instances. As such, it appears in many problems of medical image analysis, like the whole-slide images classification of biopsy. Most recently, MIL was also applied to deep architectures by introducing the aggregation operator, which focuses on crucial instances of a bag. In this paper, we enrich this idea with the self-attention mechanism to take into account dependencies across the instances. We conduct several experiments and show that our method with various types of kernels increases the accuracy, especially in the case of non-standard MIL assumptions. This is of importance for real-word medical problems, which usually satisfy presence-based or threshold-based assumptions.

assumption, deep learning, neural network, (19 more...)

arXiv.org Machine Learning

2005.12991

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Diagnostic Medicine (0.88)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Biologically-Inspired Spatial Neural Networks

Wołczyk, Maciej, Tabor, Jacek, Śmieja, Marek, Maszke, Szymon

arXiv.org Machine LearningOct-7-2019

We introduce bio-inspired artificial neural networks consisting of neurons that are additionally characterized by spatial positions. To simulate properties of biological systems we add the costs penalizing long connections and the proximity of neurons in a two-dimensional space. Our experiments show that in the case where the network performs two different tasks, the neurons naturally split into clusters, where each cluster is responsible for processing a different task. This behavior not only corresponds to the biological systems, but also allows for further insight into interpretability or continual learning.

artificial intelligence, neural network, neuron, (17 more...)

arXiv.org Machine Learning

1910.02776

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Geometric Graph Convolutional Neural Networks

Spurek, Przemysław, Danel, Tomasz, Tabor, Jacek, Śmieja, Marek, Struski, Łukasz, Słowik, Agnieszka, Maziarka, Łukasz

arXiv.org Machine LearningSep-11-2019

Graph Convolutional Networks (GCNs) have recently become the primary choice for learning from graph-structured data, superseding hash fingerprints in representing chemical compounds. However, GCNs lack the ability to take into account the ordering of node neighbors, even when there is a geometric interpretation of the graph vertices that provides an order based on their spatial positions. To remedy this issue, we propose Geometric Graph Convolutional Network (geo-GCN) which uses spatial features to efficiently learn from graphs that can be naturally located in space. Our contribution is threefold: we propose a GCN-inspired architecture which (i) leverages node positions, (ii) is a proper generalisation of both GCNs and Convolutional Neural Networks (CNNs), (iii) benefits from augmentation which further improves the performance and assures invariance with respect to the desired properties. Empirically, geo-GCN outperforms state-of-the-art graph-based methods on image classification and chemical tasks. Introduction Convolutional Neural Networks (CNNs) outperform humans on visual learning tasks, such as image classification (Krizhevsky, Sutskever, and Hinton 2012), object detection (Seferbekov et al. 2018) or image captioning (Y ang et al. 2017). They have also been successfully applied to text processing (Kim 2014) and time series analysis (Y ang et al. 2015). Nevertheless, CNNs cannot be easily adapted to irregular entities, such as graphs, where data representation is not organised in a grid-like structure. Graph Convolutional Networks (GCNs) attempt to mimic CNNs by operating on spatially close neighbors. Motivated by spectral graph theory, Kipf and Welling (Kipf and Welling 2016) use fixed weights determined by the adjacency matrix of a graph to aggregate labels of the neighbors.

convolution, deep learning, neural network, (19 more...)

arXiv.org Machine Learning

1909.0531

Genre: Research Report (1.00)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

Add feedback

SeGMA: Semi-Supervised Gaussian Mixture Auto-Encoder

Śmieja, Marek, Wołczyk, Maciej, Tabor, Jacek, Geiger, Bernhard C.

arXiv.org Artificial IntelligenceJun-21-2019

We propose a semi-supervised generative model, SeGMA, which learns a joint probability distribution of data and their classes and which is implemented in a typical Wasserstein auto-encoder framework. We choose a mixture of Gaussians as a target distribution in latent space, which provides a natural splitting of data into clusters. To connect Gaussian components with correct classes, we use a small amount of labeled data and a Gaussian classifier induced by the target distribution. SeGMA is optimized efficiently due to the use of Cramer-Wold distance as a maximum mean discrepancy penalty, which yields a closed-form expression for a mixture of spherical Gaussian components and thus obviates the need of sampling. While SeGMA preserves all properties of its semi-supervised predecessors and achieves at least as good generative performance on standard benchmark data sets, it presents additional features: (a) interpolation between any pair of points in the latent space produces realistically-looking samples; (b) combining the interpolation property with disentangled class and style variables, SeGMA is able to perform a continuous style transfer from one class to another; (c) it is possible to change the intensity of class characteristics in a data point by moving the latent representation of the data point away from specific Gaussian components.

deep learning, latent space, neural network, (20 more...)

arXiv.org Artificial Intelligence

1906.09333

Country:

Europe > Poland (0.14)
Europe > Austria (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback