How important are activation functions in regression and classification? A survey, performance comparison, and future directions