Optimizing Performance of Feedforward and Convolutional Neural Networks through Dynamic Activation Functions

Rane, Chinmay, Tyagi, Kanishka, Manry, Michael

Aug-10-2023–arXiv.org Artificial Intelligence

Deep learning training training algorithms are a huge success in recent years in many fields including speech, text,image video etc. Deeper and deeper layers are proposed with huge success with resnet structures having around 152 layers. Shallow convolution neural networks(CNN's) are still an active research, where some phenomena are still unexplained. Activation functions used in the network are of utmost importance, as they provide non linearity to the networks. Relu's are the most commonly used activation function.We show a complex piece-wise linear(PWL) activation in the hidden layer. We show that these PWL activations work much better than relu activations in our networks for convolution neural networks and multilayer perceptrons. Result comparison in PyTorch for shallow and deep CNNs are given to further strengthen our case.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

Aug-10-2023

arXiv.org PDF

Add feedback

Country:
- South America > Uruguay
  - Maldonado > Maldonado (0.04)
- North America > United States
  - Texas > Tarrant County
    - Arlington (0.04)
  - New York > New York County
    - New York City (0.04)
  - Massachusetts
    - Suffolk County > Boston (0.04)
    - Middlesex County > Marlborough (0.04)
  - California
    - San Diego County > San Diego (0.04)
    - Los Angeles County > Agoura Hills (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - Japan (0.04)

Genre:
- Research Report > New Finding (0.68)

Industry:
- Health & Medicine > Therapeutic Area (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found