Tractable Reinforcement Learning of Signal Temporal Logic Objectives