LIME: Making LLM Data More Efficient with Linguistic Metadata Embeddings