Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models