T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Neural Information Processing Systems 

Recent advancements in large language models have demonstrated how chain-of-thought (CoT) and reinforcement learning (RL) can improve performance. However, applying such reasoning strategies to the visual generation domain remains largely unexplored.