MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing