Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?