MusicRL: Aligning Music Generation to Human Preferences