Asynchronous Parallel Reinforcement Learning for Optimizing Propulsive Performance in Fin Ray Control