FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification

Mar-21-2025, 18:31:03 GMT–Neural Information Processing Systems

Diffusion Transformers (DiT) have attracted significant attention in research. However, they suffer from a slow convergence rate. In this paper, we aim to accelerate DiT training without any architectural modification. We identify the following issues in the training process: firstly, certain training strategies do not consistently perform well across different data. Secondly, the effectiveness of supervision at specific timesteps is limited. In response, we propose the following contributions: (1) We introduce a new perspective for interpreting the failure of the strategies. Specifically, we slightly extend the definition of Signal-to-Noise Ratio (SNR) and suggest observing the Probability Density Function (PDF) of SNR to understand the essence of the data robustness of the strategy.

arxiv preprint arxiv, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Mar-21-2025, 18:31:03 GMT

Conferences PDF

Add feedback

Country:
- Asia > China (0.14)
- Europe > Germany (0.14)

Genre:
- Research Report
  - Experimental Study (0.93)
  - New Finding (0.67)

Industry:
- Health & Medicine (0.68)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Neural Networks (1.00)
    - Natural Language (0.95)
    - Vision (1.00)
  - Sensing and Signal Processing > Image Processing (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found