Low-Fidelity Video Encoder Optimization for Temporal Action Localization