Disentangled Interleaving Variational Encoding

Wong, Noelle Y. L., Cheu, Eng Yeow, Chiam, Zhonglin, Srinivasan, Dipti

Jan-16-2025–arXiv.org Machine Learning

Conflicting objectives present a considerable challenge in interleaving multi-task learning, necessitating the need for meticulous design and balance to ensure effective learning of a representative latent data space across all tasks without mutual negative impact. Our proposed model, Deep Disentangled Interleaving Variational Encoding (Deep-DIVE) learns disentangled features from the original input to form clusters in the embedding space and unifies these features via the cross-attention mechanism in the fusion stage. We theoretically prove that combining the objectives for reconstruction and forecasting fully captures the lower bound and mathematically derive a loss function for disentanglement using Naïve Bayes. Experiments on two public datasets show that DeepDIVE disentangles the original input and yields forecast accuracies better than the original VAE and comparable to existing state-of-the-art baselines. In multi-objective deep learning, gradients from different objectives can conflict, when the different loss terms induce competing gradient directions during training of the network. Balancing these gradients to ensure stable and effective learning is a significant challenge prompting the development of methods to mitigate this issue, such as Liu et al. (2021); Yu et al. (2020); Sener & Koltun (2018) which solve an additional optmization problem before each gradient update step, to manipulate conflicting gradients before the update.

deepdive, dimension, latent space, (15 more...)

arXiv.org Machine Learning

Jan-16-2025

arXiv.org PDF

Add feedback

Country:
- Asia > Singapore (0.04)
- North America > Canada
  - Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (1.00)
    - Neural Networks (1.00)
    - Statistical Learning (1.00)
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found