One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration