Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference