Dyadic Reinforcement Learning