A novel approach to model exploration for value function learning