Cycle Consistency as Reward: Learning Image-Text Alignment without Human Preferences