Towards Answering Health-related Questions from Medical Videos: Datasets and Approaches