Embodied Question Answering in Photorealistic Environments with Point Cloud Perception