Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training