schmidhuber
Country:
- North America > Canada > Quebec > Montreal (0.14)
- Oceania > Australia (0.04)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- Asia > Middle East > Jordan (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)
Country:
- North America > United States > Utah (0.04)
- Asia > Taiwan > Taiwan Province > Taipei (0.04)
- Asia > Middle East > Jordan (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)
Country:
- Europe > Switzerland (0.04)
- Europe > Netherlands > North Holland > Amsterdam (0.04)
- Europe > Finland > Uusimaa > Helsinki (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Country:
- North America > United States (0.14)
- Asia > China > Guangdong Province > Shenzhen (0.04)
Technology:
Country:
- North America > United States (0.04)
- Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
Technology:
Country:
- North America > United States > Massachusetts > Suffolk County > Boston (0.04)
- North America > United States > District of Columbia > Washington (0.04)
- North America > United States > California > Alameda County > Berkeley (0.04)
- (3 more...)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Country:
- North America > Canada > Quebec > Montreal (0.05)
- North America > United States > Colorado > Boulder County > Boulder (0.04)
Technology:
Country:
- North America > Canada > Quebec > Montreal (0.04)
- Asia > Middle East > Jordan (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Breaking the Activation Function Bottleneck through Adaptive Parameterization
Sebastian Flennerhag, Hujun Yin, John Keane, Mark Elliot
Adaptive parameterization is a means of increasing this flexibility and thereby increasing the model's capacity to learn non-linear patterns. We focus on the feed-forward layer, f(x):= φ(W x+b),for some activation functionφ: R 7 R. Define the pre-activation layer as a = A(x):= Wx+band denote byg(a):= φ(a)/athe activation effect ofφgivena, where divisioniselement-wise.
Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.62)
Country:
- Europe > Germany > North Rhine-Westphalia > Upper Bavaria > Munich (0.04)
- Oceania > Australia > Queensland (0.04)
- Oceania > Australia > New South Wales > Sydney (0.04)
- (7 more...)
Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
- Information Technology > Artificial Intelligence > Cognitive Science (0.93)
- Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.69)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)