Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models