Supplementary Material for Text Promptable Surgical Instrument Segmentation with Vision-Language Models Zijian Zhou