Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes