Can You Label Less by Using Out-of-Domain Data? Active & Transfer Learning with Few-shot Instructions