Domain Adaptation and Multi-view Attention for Learnable Landmark Tracking with Sparse Data