ViPER: Empowering the Self-Evolution of Visual Perception Abilities in Vision-Language Model