A generalised OMP algorithm for feature selection with application to gene expression data