regularization method
Country:
- Asia > South Korea > Seoul > Seoul (0.05)
- North America > Canada (0.04)
- Asia > Middle East > Jordan (0.04)
- Africa > Middle East > Tunisia > Ben Arous Governorate > Ben Arous (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
Genre:
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (0.93)
Technology:
- Information Technology > Artificial Intelligence > Natural Language (0.93)
- Information Technology > Data Science (0.92)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.41)
Well-tunedSimpleNetsExcelon TabularDatasets
Weempirically assess theimpact oftheseregularization cocktailsforMLPs ina large-scale empirical study comprising 40 tabular datasets and demonstrate that (i) well-regularized plain MLPs significantly outperform recent state-of-the-art specialized neural network architectures, and (ii) they even outperform strong traditionalMLmethods,suchasXGBoost.
Country:
- Europe > Germany > Baden-Württemberg > Freiburg (0.04)
- North America > United States (0.04)
Technology:
Technology:
Country:
- Asia > China > Beijing > Beijing (0.05)
- North America > United States > Michigan (0.04)
Technology:
Country:
- North America > United States (0.04)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)
Country:
- Asia > China > Jiangsu Province (0.04)
- Asia > China > Chongqing Province > Chongqing (0.04)
R-Drop: RegularizedDropoutforNeuralNetworks
In this paper,we introduce asimple yet more effectivealternativeto regularize the training inconsistencyinduced bydropout, named asR-Drop. Concretely,ineachmini-batch training, eachdata sample goes through the forward pass twice, and each pass isprocessed by adifferent sub model by randomly dropping out some hidden units.
Technology:
Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)