sEHR-CE: Language modelling of structured EHR data for efficient and generalizable patient cohort expansion