WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset