Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision Processes