Scalable Vision-Based 3D Object Detection and Monocular Depth Estimation for Autonomous Driving