Symmetry-aware Reinforcement Learning for Robotic Assembly under Partial Observability with a Soft Wrist