Understanding Posterior Collapse in Generative Latent Variable Models

less than 1 minute read

Published:

postetior collapse

  • variational distribution matches the prior for some dimentions
  • previous work: the KL term causes it
  • this paper: spurious local maxima in training objective
    • study from linear VAE: marginal distribution encourages postetior collapse
      • rotation matrix accounts for weak identifiability in the model
    • variance of the observation model impacts the stability of local stationary points
      • large sigma tends towards a stable local maxima
      • ELBO does not introduce any additional spurious local maxima