Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models

Neural Information Processing Systems 

Disentangled representation learning (DRL) aims to identify and decompose underlying factors behind observations, thus facilitating data perception and generation.