Decomposed Reasoning with Reinforcement Learning for Relevance Assessment in UGC Platforms