Enhancing Token Filtering Efficiency in Large Language Model Training with Collider