CMMA: Benchmarking Multi-Affection Detection in Chinese Multi-Modal Conversations Supplementary Document