SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

Open in new window