Inefficiency of K-FAC for Large Batch Size Training

Open in new window