Decomposed Reasoning with Reinforcement Learning for Relevance Assessment in UGC Platforms

Open in new window