Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of RLHF

Open in new window