Improving Romanian LLM Pretraining Data using Diversity and Quality Filtering