Online Learning and Planning in Partially Observable Domains without Prior Knowledge