Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance