Generalizable Coarse-to-Fine Robot Manipulation via Language-Aligned 3D Keypoints