In-Hand Object Pose Estimation via Visual-Tactile Fusion