ScGMAI: A Gaussian mixture model for clustering single-cell RNA-Seq data based on deep autoencoder

  • Bin Yu*
  • , Chen Chen
  • , Ren Qi
  • , Ruiqing Zheng
  • , Patrick J. Skillman-Lawrence
  • , Xiaolin Wang
  • , Anjun Ma
  • , Haiming Gu
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

57 Citations (Scopus)

Abstract

The rapid development of single-cell RNA sequencing (scRNA-Seq) technology provides strong technical support for accurate and efficient analyzing single-cell gene expression data. However, the analysis of scRNA-Seq is accompanied by many obstacles, including dropout events and the curse of dimensionality. Here, we propose the scGMAI, which is a new single-cell Gaussian mixture clustering method based on autoencoder networks and the fast independent component analysis (FastICA). Specifically, scGMAI utilizes autoencoder networks to reconstruct gene expression values from scRNA-Seq data and FastICA is used to reduce the dimensions of reconstructed data. The integration of these computational techniques in scGMAI leads to outperforming results compared to existing tools, including Seurat, in clustering cells from 17 public scRNA-Seq datasets. In summary, scGMAI is an effective tool for accurately clustering and identifying cell types from scRNA-Seq data and shows the great potential of its applicative power in scRNA-Seq data analysis. The source code is available at https://github.com/QUST-AIBBDRC/scGMAI/.

Original languageEnglish
Article numberbbaa316
JournalBriefings in Bioinformatics
Volume22
Issue number4
DOIs
Publication statusPublished - 1 Jul 2021
Externally publishedYes

Keywords

  • autoencoder networks
  • cell clustering
  • fast independent component analysis
  • Gaussian mixture model
  • scRNA-Seq

Fingerprint

Dive into the research topics of 'ScGMAI: A Gaussian mixture model for clustering single-cell RNA-Seq data based on deep autoencoder'. Together they form a unique fingerprint.

Cite this