login
Home / Papers / Statistics or biology: the zero-inflation controversy about scRNA-seq data

Statistics or biology: the zero-inflation controversy about scRNA-seq data

615 Citations2022
Ruochen Jiang, Tianyi Sun, Dongyuan Song

The sources and impacts of non-biological zeros in single-cell RNA-seq data differently are discussed and the importance of transparent analysis is advocated, to help address the controversy.

Abstract

Researchers view vast zeros in single-cell RNA-seq data differently: some regard zeros as biological signals representing no or low gene expression, while others regard zeros as missing data to be corrected. To help address the controversy, here we discuss the sources of biological and non-biological zeros; introduce five mechanisms of adding non-biological zeros in computational benchmarking; evaluate the impacts of non-biological zeros on data analysis; benchmark three input data types: observed counts, imputed counts, and binarized counts; discuss the open questions regarding non-biological zeros; and advocate the importance of transparent analysis.