Skip to main content
Journal cover image

RCA2: a scalable supervised clustering algorithm that reduces batch effects in scRNA-seq data.

Publication ,  Journal Article
Schmidt, F; Ranjan, B; Lin, QXX; Krishnan, V; Joanito, I; Honardoost, MA; Nawaz, Z; Venkatesh, PN; Tan, J; Rayan, NA; Ong, ST; Prabhakar, S
Published in: Nucleic Acids Res
September 7, 2021

The transcriptomic diversity of cell types in the human body can be analysed in unprecedented detail using single cell (SC) technologies. Unsupervised clustering of SC transcriptomes, which is the default technique for defining cell types, is prone to group cells by technical, rather than biological, variation. Compared to de-novo (unsupervised) clustering, we demonstrate using multiple benchmarks that supervised clustering, which uses reference transcriptomes as a guide, is robust to batch effects and data quality artifacts. Here, we present RCA2, the first algorithm to combine reference projection (batch effect robustness) with graph-based clustering (scalability). In addition, RCA2 provides a user-friendly framework incorporating multiple commonly used downstream analysis modules. RCA2 also provides new reference panels for human and mouse and supports generation of custom panels. Furthermore, RCA2 facilitates cell type-specific QC, which is essential for accurate clustering of data from heterogeneous tissues. We demonstrate the advantages of RCA2 on SC data from human bone marrow, healthy PBMCs and PBMCs from COVID-19 patients. Scalable supervised clustering methods such as RCA2 will facilitate unified analysis of cohort-scale SC datasets.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Nucleic Acids Res

DOI

EISSN

1362-4962

Publication Date

September 7, 2021

Volume

49

Issue

15

Start / End Page

8505 / 8519

Location

England

Related Subject Headings

  • Transcriptome
  • Single-Cell Analysis
  • RNA-Seq
  • RNA, Small Cytoplasmic
  • Quality Control
  • Organ Specificity
  • Mice
  • Leukocytes, Mononuclear
  • Humans
  • Developmental Biology
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Schmidt, F., Ranjan, B., Lin, Q. X. X., Krishnan, V., Joanito, I., Honardoost, M. A., … Prabhakar, S. (2021). RCA2: a scalable supervised clustering algorithm that reduces batch effects in scRNA-seq data. Nucleic Acids Res, 49(15), 8505–8519. https://doi.org/10.1093/nar/gkab632
Schmidt, Florian, Bobby Ranjan, Quy Xiao Xuan Lin, Vaidehi Krishnan, Ignasius Joanito, Mohammad Amin Honardoost, Zahid Nawaz, et al. “RCA2: a scalable supervised clustering algorithm that reduces batch effects in scRNA-seq data.Nucleic Acids Res 49, no. 15 (September 7, 2021): 8505–19. https://doi.org/10.1093/nar/gkab632.
Schmidt F, Ranjan B, Lin QXX, Krishnan V, Joanito I, Honardoost MA, et al. RCA2: a scalable supervised clustering algorithm that reduces batch effects in scRNA-seq data. Nucleic Acids Res. 2021 Sep 7;49(15):8505–19.
Schmidt, Florian, et al. “RCA2: a scalable supervised clustering algorithm that reduces batch effects in scRNA-seq data.Nucleic Acids Res, vol. 49, no. 15, Sept. 2021, pp. 8505–19. Pubmed, doi:10.1093/nar/gkab632.
Schmidt F, Ranjan B, Lin QXX, Krishnan V, Joanito I, Honardoost MA, Nawaz Z, Venkatesh PN, Tan J, Rayan NA, Ong ST, Prabhakar S. RCA2: a scalable supervised clustering algorithm that reduces batch effects in scRNA-seq data. Nucleic Acids Res. 2021 Sep 7;49(15):8505–8519.
Journal cover image

Published In

Nucleic Acids Res

DOI

EISSN

1362-4962

Publication Date

September 7, 2021

Volume

49

Issue

15

Start / End Page

8505 / 8519

Location

England

Related Subject Headings

  • Transcriptome
  • Single-Cell Analysis
  • RNA-Seq
  • RNA, Small Cytoplasmic
  • Quality Control
  • Organ Specificity
  • Mice
  • Leukocytes, Mononuclear
  • Humans
  • Developmental Biology