Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning