Goto

Collaborating Authors

 resnet101


[ Supplementary Material ] Learning to Adapt via Latent Domains for Adaptive Semantic Segmentation Anonymous Author(s) Affiliation Address email

Neural Information Processing Systems

AAppendix1 In the supplementary material, we provide more experimental results summarized as follows:2 In A.1, we use ResNet101 as the backbone network and compare our method with state-of-3 the-art methods, demonstrating that our method achieves consistent top results on different4 In A.2, we provide more t-SNE visualization results for a comprehensive analysis on the6 feature space learned from different models.7 In A.3, we study the effect of the image-to-image translation model on the performance of8 domain adaptive semantic segmentation.9 In A.4, we discuss the limitations of our method and provide the URL link of code to10 reproduce the main experimental results.11 "V" and "R" indicate the method using VGG16 and ResNet101 backbone networks, respectively. In the main paper, we report results using VGG1613 as the backbone for both settings: single-target14 and multi-target domain adaptation.



SupplementaryMaterialofDeepMultimodalFusion byChannelExchanging

Neural Information Processing Systems

Corollary1statesthatf0 ismoreexpressivethan f when ฮณ = 0, and thus the optimalf0 always outputs no higher loss, which, yet, is not true for arbitraryf0 (e.g.


339a18def9898dd60a634b2ad8fbbd58-Paper.pdf

Neural Information Processing Systems

Such message exchanging isparameter-free andselfadaptive, as it is dynamically controlled by the scaling factors that are determined by the training itself.




Data Heterogeneity and Forgotten Labels in Split Federated Learning

arXiv.org Artificial Intelligence

In Split Federated Learning (SFL), the clients collaboratively train a model with the help of a server by splitting the model into two parts. Part-1 is trained locally at each client and aggregated by the aggregator at the end of each round. Part-2 is trained at a server that sequentially processes the intermediate activations received from each client. We study the phenomenon of catastrophic forgetting (CF) in SFL in the presence of data heterogeneity. In detail, due to the nature of SFL, local updates of part-1 may drift away from global optima, while part-2 is sensitive to the processing sequence, similar to forgetting in continual learning (CL). Specifically, we observe that the trained model performs better in classes (labels) seen at the end of the sequence. We investigate this phenomenon with emphasis on key aspects of SFL, such as the processing order at the server and the cut layer. Based on our findings, we propose Hydra, a novel mitigation method inspired by multi-head neural networks and adapted for the SFL's setting. Extensive numerical evaluations show that Hydra outperforms baselines and methods from the literature.



A Additional Experiments

Neural Information Processing Systems

For more complicated periodic functions, see Figure 11 and 12. For a larger value of a, the extrapolation improves. We use Adam as the optimizer. The region within the dashed vertical lines are the range of the training set. Learning range is indicated by the blue vertical bars.


Response to Reviewer 5

Neural Information Processing Systems

We appreciate suggestions from R6, 7, 8 and will include these in the paper. We have included most competitive methods with comparable settings to ours at the submission time. We will include the shown Algorithm 1. S sampled such that all attributes are present? However, our framework can compose features from any set S by solving Eq (10) even with missing attributes in S . Please notice that they are different.