Generative human motion mimicking through feature extraction in denoising diffusion settings