Approximate Bayesian Inference for Science

Linear models and all their extensions (linear mixed models, GLMs, matrix factorization) have many advantages. First, reliable inference procedures, sometimes with guarantee of convergence and associated code are often widely available for those models. Second, these simple models are interpretable in the sense that it is possible to know which covariate is most associated with a certain target. Often, there are also direct connections to hypothesis testing.
To go beyond conjugacy and linearity, users of deep generative models sometimes sacrifice part of those benefits. In these manuscripts, we study two practical problems that emerged with the use of variational autoencoders: disentanglement of latent variables as well as hypothesis testing (in the more general context of decision-making). We also review recent advances and challenges of applying deep generative models for scientific discoveries in molecular biology applications.