An example Markov Decision Process model on setting rewards in a text sentence?