VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting