Neural Proxies for Sound Synthesizers: Learning Perceptually Informed Preset Representations