sub-challenge
MultiEarth 2023 -- Multimodal Learning for Earth and Environment Workshop and Challenge
Cha, Miriam, Angelides, Gregory, Hamilton, Mark, Soszynski, Andy, Swenson, Brandon, Maidel, Nathaniel, Isola, Phillip, Perron, Taylor, Freeman, Bill
The Multimodal Learning for Earth and Environment Workshop (MultiEarth 2023) is the second annual CVPR workshop aimed at the monitoring and analysis of the health of Earth ecosystems by leveraging the vast amount of remote sensing data that is continuously being collected. The primary objective of this workshop is to bring together the Earth and environmental science communities as well as the multimodal representation learning communities to explore new ways of harnessing technological advancements in support of environmental monitoring. The MultiEarth Workshop also seeks to provide a common benchmark for processing multimodal remote sensing information by organizing public challenges focused on monitoring the Amazon rainforest. These challenges include estimating deforestation, detecting forest fires, translating synthetic aperture radar (SAR) images to the visible domain, and projecting environmental trends. This paper presents the challenge guidelines, datasets, and evaluation metrics. Our challenge website is available at https://sites.google.com/view/rainforest-challenge/multiearth-2023.
The ACM Multimedia 2023 Computational Paralinguistics Challenge: Emotion Share & Requests
Schuller, Björn W., Batliner, Anton, Amiriparian, Shahin, Barnhill, Alexander, Gerczuk, Maurice, Triantafyllopoulos, Andreas, Baird, Alice, Tzirakis, Panagiotis, Gagne, Chris, Cowen, Alan S., Lackovic, Nikola, Caraty, Marie-José, Montacié, Claude
The ACM Multimedia 2023 Computational Paralinguistics Challenge addresses two different problems for the first time in a research competition under well-defined conditions: In the Emotion Share Sub-Challenge, a regression on speech has to be made; and in the Requests Sub-Challenges, requests and complaints need to be detected. We describe the Sub-Challenges, baseline feature extraction, and classifiers based on the usual ComPaRE features, the auDeep toolkit, and deep feature extraction from pre-trained CNNs using the DeepSpectRum toolkit; in addition, wav2vec2 models are used.
Hybrid Multimodal Feature Extraction, Mining and Fusion for Sentiment Analysis
Li, Jia, Zhang, Ziyang, Lang, Junjie, Jiang, Yueqi, An, Liuwei, Zou, Peng, Xu, Yangyang, Gao, Sheng, Lin, Jie, Fan, Chunxiao, Sun, Xiao, Wang, Meng
In this paper, we present our solutions for the Multimodal Sentiment Analysis Challenge (MuSe) 2022, which includes MuSe-Humor, MuSe-Reaction and MuSe-Stress Sub-challenges. The MuSe 2022 focuses on humor detection, emotional reactions and multimodal emotional stress utilizing different modalities and data sets. In our work, different kinds of multimodal features are extracted, including acoustic, visual, text and biological features. These features are fused by TEMMA and GRU with self-attention mechanism frameworks. In this paper, 1) several new audio features, facial expression features and paragraph-level text embeddings are extracted for accuracy improvement. 2) we substantially improve the accuracy and reliability of multimodal sentiment prediction by mining and blending the multimodal features. 3) effective data augmentation strategies are applied in model training to alleviate the problem of sample imbalance and prevent the model from learning biased subject characters. For the MuSe-Humor sub-challenge, our model obtains the AUC score of 0.8932. For the MuSe-Reaction sub-challenge, the Pearson's Correlations Coefficient of our approach on the test set is 0.3879, which outperforms all other participants. For the MuSe-Stress sub-challenge, our approach outperforms the baseline in both arousal and valence on the test dataset, reaching a final combined result of 0.5151.
AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition
Ringeval, Fabien, Schuller, Björn, Valstar, Michel, Cummins, NIcholas, Cowie, Roddy, Tavabi, Leili, Schmitt, Maximilian, Alisamir, Sina, Amiriparian, Shahin, Messner, Eva-Maria, Song, Siyang, Liu, Shuo, Zhao, Ziping, Mallol-Ragolta, Adria, Ren, Zhao, Soleymani, Mohammad, Pantic, Maja
The Audio/Visual Emotion Challenge and Workshop (AVEC 2019) "State-of-Mind, Detecting Depression with AI, and Cross-cultural Affect Recognition" is the ninth competition event aimed at the comparison of multimedia processing and machine learning methods for automatic audiovisual health and emotion analysis, with all participants competing strictly under the same conditions. The goal of the Challenge is to provide a common benchmark test set for multimodal information processing and to bring together the health and emotion recognition communities, as well as the audiovisual processing communities, to compare the relative merits of various approaches to health and emotion recognition from real-life data. This paper presents the major novelties introduced this year, the challenge guidelines, the data used, and the performance of the baseline systems on the three proposed tasks: state-of-mind recognition, depression assessment with AI, and cross-cultural affect sensing, respectively.