Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension

Open in new window