Deterministic Inference across Tensor Parallel Sizes That Eliminates Training-Inference Mismatch