ExGes: Expressive Human Motion Retrieval and Modulation for Audio-Driven Gesture Synthesis