Memory Analysis on the Training Course of DeepSeek Models