Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning