Combining learning rate decay and weight decay with complexity gradient descent - Part I

Open in new window