Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming