Domain Knowledge Integration By Gradient Matching For Sample-Efficient Reinforcement Learning