Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task