Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content