PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation