BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification