TB or Not TB: Coverage-Driven Direct Preference Optimization for Verilog Stimulus Generation