From Generation to Detection: A Multimodal Multi-Task Dataset for Benchmarking Health Misinformation