Sub-Task Decomposition Enables Learning in Sequence to Sequence Tasks