Learning Generalizable Representations for Reinforcement Learning via Adaptive Meta-learner of Behavioral Similarities