Learning Logic Specifications for Soft Policy Guidance in POMCP