Dirichlet_Graph_Variational_Autoencoder_V3.pdf

yurong

Neural Information Processing Systems 

Hence, the regularization is used to maximize the sample variance. To simplify the notation (i.e., ignore the constant), As the slater condition holds, KKT condition is necessary and sufficient.