Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment