[D] Same dropout probability for every dropout layer? • r/MachineLearning
Would you set the dropout probability for every layer to the same value? I've seen this in some papers and I assume it is because this mininizes the hyperparameters which need to be optimized. What do you think about this?
Jun-5-2018, 13:26:57 GMT
- Technology: