Deep Generative Models for Single-cell Transcriptomics

Single-cell Variational Inference (scVI) is a set of tools for probabilistic analysis of single-cell RNA-sequencing data. We propose to solve quantitative biological questions with tailored and scalable Bayesian machinery. Examples of biological questions include cell-type discovery, data integration, transfer of annotations and differential expression analysis. Examples of statistical tools include deep generative models, stochastic variational inference, Bayesian decision theory and domain adaptation.
All the models in our codebase may fit millions of transcriptomic measurements in a few hours. In addition to the original publication, we recommend the following ressources to get more familiar with scVI:
- our codebase and documentation which includes user-friendly notebooks
- the podcast recorded on the bioinformatics chat
- the YosefLab blog usually includes discussions about special topics
- the News & View in Nature Methods
- our post on the Berkeley AI Research blog
Publications
Joint Probabilistic Modeling of Paired Transcriptome and Proteome Measurements in Single Cells
The paired measurement of RNA and surface protein abundance in single cells with CITE-seq is a promising approach to connect …
Probabilistic Harmonization and Annotation of Single-cell Transcriptomics Data with Deep Generative Models
As single-cell transcriptomics becomes a mainstream technology, the natural next step is to integrate the accumulating data in order to …
A Joint Model of RNA Expression and Surface Protein Abundance in Single Cells
Cellular indexing of transcriptomes and epitopes by sequencing (CITE-seq) combines unbiased single-cell transcriptome measurements with …
Deep Generative Models for Detecting Differential Expression in Single Cells
Detecting differentially expressed genes is important for characterizing subpopulations of cells. However, in scRNA-seq data, nuisance …
Detecting Zero-Inflated Genes in Single-Cell Transcriptomics Data
In single-cell RNA sequencing data, biological processes or technical factors may induce an overabundance of zero measurements. …
A Joint Model of Unpaired Data from scRNA-seq and Spatial Transcriptomics for Imputing Missing Gene Expression Measurements
Spatial studies of transcriptome provide biologists with gene expression maps of heterogeneous and complex tissues. However, most …
Deep Generative Modeling for Single-cell Transcriptomics
Single-cell transcriptome measurements can reveal unexplored biological diversity, but they suffer from technical noise and bias that …
A Deep Generative Model for Gene Expression profiles from Single-cell RNA Sequencing with Application to Differential Expression
We propose a probabilistic model for interpreting gene expression levels that are observed through single-cell RNA sequencing. In the …