Scaling Open-Vocabulary Object Detection