Policy-driven Knowledge Selection and Response Generation for Document-grounded Dialogue