Beyond Model Ranking: Predictability-Aligned Evaluation for Time Series Forecasting