Dynamic one-time delivery of critical data by small and sparse UAV swarms: a model problem for MARL scaling studies