Fine-Tuning Language Models with Just Forward Passes Sadhika Malladi Tianyu Gao
–Neural Information Processing Systems
In this work, we propose a memory-efficient zeroth-order optimizer ( MeZO), adapting the classical ZO-SGD method to operate in-place, thereby fine-tuning LMs with the same memory footprint as inference .
Neural Information Processing Systems
Oct-9-2025, 03:51:02 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- North America > United States (0.27)
- Asia > Middle East
- Genre:
- Research Report > New Finding (1.00)
- Technology: