VideoOrion: Tokenizing Object Dynamics in Videos