Learning in Non-Cooperative Configurable Markov Decision Processes Alberto Maria Metelli ETH AI Center Politecnico di Milano Zurich, Switzerland