Fisher-Orthogonal Projection Methods for Natural Gradient Descent with Large Batches

Open in new window