AITopics | Perceptrons

The term neural networks refers to networks of neurons in the mammalian brain. Neurons are its fundamental units of computation. In the brain they are connected together in networks to process data. This can be a very complex task, and the dynamics of neural networks in the mammalian brain in response to external stimuli can therefore be quite intricate. Inputs and outputs of each neuron vary as functions of time, in the form of so-called spike trains, but also the network itself changes. We learn and improve our data-processing capacities by establishing reconnections between neurons. Neural-networkalgorithms are inspired by the architecture and the dynamics of networks of neurons in the brain. Yet the algorithms use neuron models that are highly simplified, compared with real neurons. Nevertheless, the fundamental principle is the same: artificial neural networks learn by reconnection.

algorithm, equation, neuron, (16 more...)

arXiv.org Machine Learning

1901.05639

Country:

Europe > Sweden > Vaestra Goetaland > Gothenburg (0.13)
North America > Canada > Ontario > Toronto (0.13)
North America > United States > California > Orange County > Irvine (0.04)
(5 more...)

Genre:

Research Report (1.00)
Summary/Review (0.67)
Instructional Material > Course Syllabus & Notes (0.45)

Industry:

Leisure & Entertainment (1.00)
Information Technology (1.00)
Health & Medicine > Therapeutic Area > Neurology (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.69)

Add feedback

Activation Functions for Generalized Learning Vector Quantization - A Performance Comparison

Villmann, Thomas, Ravichandran, John, Villmann, Andrea, Nebel, David, Kaden, Marika

arXiv.org Machine LearningJan-17-2019

An appropriate choice of the activation function (like ReLU, sigmoid or swish) plays an important role in the performance of (deep) multilayer perceptrons (MLP) for classification and regression learning. Prototype-based classification learning methods like (generalized) learning vector quantization (GLVQ) are powerful alternatives. These models also deal with activation functions but here they are applied to the so-called classifier function instead. In this paper we investigate successful candidates of activation functions known for MLPs for application in GLVQ and their influence on the performance.

activation function, glvq, learning vector quantization, (11 more...)

arXiv.org Machine Learning

1901.05995

Country:

Europe > Germany (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
South America > Paraguay > Asunción > Asunción (0.04)
(7 more...)

Genre: Research Report (0.64)

Industry: Health & Medicine > Therapeutic Area (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.57)

Add feedback

Applying SVGD to Bayesian Neural Networks for Cyclical Time-Series Prediction and Inference

Hu, Xinyu, Szerlip, Paul, Karaletsos, Theofanis, Singh, Rohit

arXiv.org Machine LearningJan-17-2019

A regression-based BNN model is proposed to predict spatiotemporal quantities like hourly rider demand with calibrated uncertainties. The main contributions of this paper are (i) A feed-forward deterministic neural network (DetNN) architecture that predicts cyclical time series data with sensitivity to anomalous forecasting events; (ii) A Bayesian framework applying SVGD to train large neural networks for such tasks, capable of producing time series predictions as well as measures of uncertainty surrounding the predictions. Experiments show that the proposed BNN reduces average estimation error by 10% across 8 U.S. cities compared to a fine-tuned multilayer perceptron (MLP), and 4% better than the same network architecture trained without SVGD.

neural network, prediction variability, svgd, (9 more...)

arXiv.org Machine Learning

1901.05906

Country:

North America > United States > California > San Francisco County > San Francisco (0.15)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(3 more...)

Genre: Research Report (0.65)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.54)

Add feedback

Weightless Neural Network with Transfer Learning to Detect Distress in Asphalt

Milhomem, Suayder, Almeida, Tiago da Silva, da Silva, Warley Gramacho, da Silva, Edeilson Milhomem, de Carvalho, Rafael Lima

arXiv.org Machine LearningJan-3-2019

Abstract-- The present paper shows a solution to the problem of automatic distress detection, more precisely the detection of holes in paved roads. To do so, the proposed solution uses a weightless neural network known as Wisard to decide whether an image of a road has any kind of cracks. In addition, the proposed architecture also shows how the use of transfer learning was able to improve the overall accuracy of the decision system. As a verification step of the research, an experiment was carried out using images from the streets at the Federal University of Tocantins, Brazil. The architecture of the developed solution presents a result of 85.71% accuracy in the dataset, proving to be superior to approaches of the state-of-the-art. I.INTRODUCTION In Brazil, most of the traffic is driven on asphalt roads.

deep learning, neural network, transfer learning, (20 more...)

arXiv.org Machine Learning

doi: 10.22161/ijaers.5.12.40

1901.0366

Country: South America > Brazil > Tocantins (0.25)

Genre: Research Report (0.64)

Industry:

Energy > Oil & Gas (0.62)
Construction & Engineering (0.62)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.30)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

Why cannot one find the zero in the delta rule for sigmoid? (No closed form to find weights in one-layer perceptron neural network?)

#artificialintelligenceJan-2-2019, 03:12:53 GMT

I know that finding the weights of a neural network requires gradient descent as there is no closed form available. I know this from the books, and not knowing exactly why the derivative w.r.t. the weights is not zero-able led me to try to do it. Let's consider the traditional sigmoid MLP, with just one layer and just one datapoint $ \mathbf{x},t $. The gradient vector of the MSE loss function w.r.t. the weights is: Now, how to solve (finding the zero) of the gradient expression? What I could do is to analyze the various factors and see where they individually zero.

artificial intelligence, machine learning, mathbf, (9 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.40)

Add feedback

Recurrent Relational Networks

Palm, Rasmus, Paquet, Ulrich, Winther, Ole

Neural Information Processing SystemsDec-31-2018

This paper is concerned with learning to solve tasks that require a chain of interde- pendent steps of relational inference, like answering complex questions about the relationships between objects, or solving puzzles where the smaller elements of a solution mutually constrain each other. We introduce the recurrent relational net- work, a general purpose module that operates on a graph representation of objects. As a generalization of Santoro et al. [2017]’s relational network, it can augment any neural network model with the capacity to do many-step relational reasoning. We achieve state of the art results on the bAbI textual question-answering dataset with the recurrent relational network, consistently solving 20/20 tasks. As bAbI is not particularly challenging from a relational reasoning point of view, we introduce Pretty-CLEVR, a new diagnostic dataset for relational reasoning. In the Pretty- CLEVR set-up, we can vary the question to control for the number of relational reasoning steps that are required to obtain the answer. Using Pretty-CLEVR, we probe the limitations of multi-layer perceptrons, relational and recurrent relational networks. Finally, we show how recurrent relational networks can learn to solve Sudoku puzzles from supervised training data, a challenging task requiring upwards of 64 steps of relational reasoning. We achieve state-of-the-art results amongst comparable methods by solving 96.6% of the hardest Sudoku puzzles.

artificial intelligence, deep learning, machine learning, (15 more...)

Neural Information Processing Systems

Genre: Workflow (0.69)

Industry: Leisure & Entertainment > Games > Sudoku (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.68)

Add feedback

Recurrent Relational Networks

Palm, Rasmus, Paquet, Ulrich, Winther, Ole

Neural Information Processing SystemsDec-31-2018

This paper is concerned with learning to solve tasks that require a chain of interde- pendent steps of relational inference, like answering complex questions about the relationships between objects, or solving puzzles where the smaller elements of a solution mutually constrain each other. We introduce the recurrent relational net- work, a general purpose module that operates on a graph representation of objects. As a generalization of Santoro et al. [2017]’s relational network, it can augment any neural network model with the capacity to do many-step relational reasoning. We achieve state of the art results on the bAbI textual question-answering dataset with the recurrent relational network, consistently solving 20/20 tasks. As bAbI is not particularly challenging from a relational reasoning point of view, we introduce Pretty-CLEVR, a new diagnostic dataset for relational reasoning. In the Pretty- CLEVR set-up, we can vary the question to control for the number of relational reasoning steps that are required to obtain the answer. Using Pretty-CLEVR, we probe the limitations of multi-layer perceptrons, relational and recurrent relational networks. Finally, we show how recurrent relational networks can learn to solve Sudoku puzzles from supervised training data, a challenging task requiring upwards of 64 steps of relational reasoning. We achieve state-of-the-art results amongst comparable methods by solving 96.6% of the hardest Sudoku puzzles.

artificial intelligence, deep learning, machine learning, (15 more...)

Neural Information Processing Systems

Genre: Workflow (0.69)

Industry: Leisure & Entertainment > Games > Sudoku (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.68)

Add feedback

Throwing everything - including the kitchen sink - at a machine learning problem

#artificialintelligenceDec-28-2018, 23:58:09 GMT

It seems the more I read, the more confused I get - models, algorithms, surrogates; my head is spinning. Assume the dataset is in perfect condition - pure as the driven snow, no correlated features, no null in sight, nothing; and it has "enough" observations. To simplify, let's say we are looking at binary classification. Let's also say that we want to try four different algorithms: for example - logistic regression, naive Bayes, gradient boosted tree and multilayer perceptron. And, finally, let's assume that (since all this is for educational purposes), we have no issues with time, efficiency, computing power, computing budget and whatnot; we don't care if this is an overkill or if we're going after a fly with an elephant gun: we want to throw everything, including the kitchen sink, at the problem so we can extract every last ounce of performance when it's time to make predictions on totally unseen data.

artificial intelligence, kitchen sink, machine learning, (2 more...)

#artificialintelligence

Industry: Education > Focused Education > Special Education (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.42)

Add feedback

Dropout Regularization in Hierarchical Mixture of Experts

İrsoy, Ozan, Alpaydın, Ethem

arXiv.org Machine LearningDec-25-2018

Dropout is a very effective method in preventing overfitting and has become the go-to regularizer for multi-layer neural networks in recent years. Hierarchical mixture of experts is a hierarchically gated model that defines a soft decision tree where leaves correspond to experts and decision nodes correspond to gating models that softly choose between its children, and as such, the model defines a soft hierarchical partitioning of the input space. In this work, we propose a variant of dropout for hierarchical mixture of experts that is faithful to the tree hierarchy defined by the model, as opposed to having a flat, unitwise independent application of dropout as one has with multi-layer perceptrons. We show that on a synthetic regression data and on MNIST and CIFAR-10 datasets, our proposed dropout mechanism prevents overfitting on trees with many levels improving generalization and providing smoother fits.

dropout, dropout rate, neural network, (15 more...)

arXiv.org Machine Learning

1812.10158

Country:

North America > Canada > Ontario > Toronto (0.14)
Asia > Middle East > Jordan (0.05)
North America > United States > New York > New York County > New York City (0.04)
(2 more...)

Genre: Research Report (0.52)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Deep Autoencoder for Recommender Systems: Parameter Influence Analysis

Tran, Dai Hoang, Hussain, Zawar, Zhang, Wei Emma, Khoa, Nguyen Lu Dang, Tran, Nguyen H., Sheng, Quan Z.

arXiv.org Machine LearningDec-24-2018

Recommender systems have recently attracted many researchers in the deep learning community. The state-of-the-art deep neural network models used in recommender systems are typically multilayer perceptron and deep Autoencoder (DAE), among which DAE usually shows better performance due to its superior capability to reconstruct the inputs. However, we found existing DAE recommendation systems that have similar implementations on similar datasets result in vastly different parameter settings. In this work, we have built a flexible DAE model, named FlexEncoder that uses configurable parameters and unique features to analyse the parameter influences on the prediction accuracy of recommender systems. This will help us identify the best-performance parameters given a dataset. Extensive evaluation on the MovieLens datasets are conducted, which drives our conclusions on the influences of DAE parameters. Specifically, we find that DAE parameters strongly affect the prediction accuracy of the recommender systems, and the effect is transferable to similar datasets in a larger size. We open our code to public which could benefit both new users for DAE -- they can quickly understand how DAE works for recommendation systems, and experienced DAE users -- it easier for them to tune the parameters on different datasets.

autoencoder, prediction, recommender system, (14 more...)

arXiv.org Machine Learning

1901.00415

Country: Oceania > Australia > New South Wales > Sydney (0.05)

Genre: Research Report > New Finding (0.48)

Industry: