MIRA: Towards Mitigating Reward Hacking in Inference-Time Alignment of T2I Diffusion Models