Discourse-Driven Evaluation: Unveiling Factual Inconsistency in Long Document Summarization