RLPrompt: Optimizing discrete text prompts with reinforcement learning