Fine-Tuning Language Models with Just Forward Passes Sadhika Malladi Tianyu Gao

Oct-9-2025, 03:51:02 GMT–Neural Information Processing Systems

In this work, we propose a memory-efficient zeroth-order optimizer ( MeZO), adapting the classical ZO-SGD method to operate in-place, thereby fine-tuning LMs with the same memory footprint as inference .

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Oct-9-2025, 03:51:02 GMT

Conferences PDF

Add feedback

Country:
- Asia > Middle East
  - Jordan (0.04)
- North America > United States (0.27)

Genre:
- Research Report > New Finding (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Neural Networks > Deep Learning (1.00)
    - Statistical Learning (0.93)
  - Natural Language > Large Language Model (0.69)
  - Representation & Reasoning (1.00)

Duplicate Docs Excel Report

Title
a627810151be4d13f907ac898ff7e948-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found