Towards Abstractive Timeline Summarisation using Preference-based Reinforcement Learning