Kronecker-Factored Approximate Curvature for Modern Neural Network Architectures