Oasis: Data Curation and Assessment System for Pretraining of Large Language Models