Causal Representation Learning from Multimodal Biological Observations