Generalized Neural Policies for Relational MDPs