Heterogeneous 360 Degree Videos in Metaverse: Differentiated Reinforcement Learning Approaches