scBKAP: a clustering model for single-cell RNA-seq data based on bisecting K-means

Xiaolin Wang, Hongli Gao, Ren Qi, Ruiqing Zheng, Xin Gao, Bin Yu

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Advances in single-cell RNA sequencing (scRNA-seq) technologies allow researchers to analyze the genome-wide transcription profile and to solve biological problems at the individual-cell resolution. However, existing clustering methods on scRNA-seq suffer from high dropout rate and curse of dimensionality in the data. Here, we propose a novel pipeline, scBKAP, the cornerstone of which is a single-cell bisecting K-means clustering method based on an autoencoder network and a dimensionality reduction model MPDR. Specially, scBKAP utilizes an autoencoder network to reconstruct gene expression values from scRNA-seq data to alleviate the dropout issue, and the MPDR model composed of the M3Drop feature selection algorithm and the PHATE dimensionality reduction algorithm to reduce the dimensions of reconstructed data. The dimensionality-reduced data are then fed into the bisecting K-means clustering algorithm to identify the clusters of cells. Comprehensive experiments demonstrate scBKAP's superior performance over nine state-of-the-art single-cell clustering methods on 21 public scRNA-seq datasets and simulated datasets.
Original languageEnglish (US)
Pages (from-to)1-10
Number of pages10
JournalIEEE/ACM Transactions on Computational Biology and Bioinformatics
DOIs
StatePublished - Dec 19 2022

Fingerprint

Dive into the research topics of 'scBKAP: a clustering model for single-cell RNA-seq data based on bisecting K-means'. Together they form a unique fingerprint.

Cite this