TLDR: Token-Level Detective Reward Model for Large Vision Language Models