Building Flexible, Scalable, and Machine Learning-ready Multimodal Oncology Datasets