This dataset was used to evaluate the removal of batch effects induced by using different scRNA-seq technologies on a big dataset. bioRxiv.
Jolliffe I. Returning to the analysis with t-SNE and UMAP visualizations, we can concur with the rankings. Among the methods that ranked in the top three, LIGER and Seurat 3 ranked top for one dataset each. Based on the MNN tutorial, 5000 highly HVGs were identified and used as input to the mnnCorrect function for batch correction. Methods with higher kBET acceptance rates are the better performing methods. We ran the assessment on our server with 1 TB of RAM and recorded the memory usage every 5 s. We then visualized the memory usage in the form of violin plots (Fig. This approach is based on the assumption that the difference between the distributions of the two datasets is moderate. Available from: http://papers.nips.cc/paper/5423-generative-adversarial-nets.pdf. Following the example in the kBET paper, we chose the k input value equal to 5%, 10%, 15%, 20%, and 25% of the sample size and ran kBET to get the median of all kBET rejection rates to produce the final kBET result for each method. Tran HTN, Ang KS, Chevrier M, Zhang X, Lee NYS, Goh M, Chen J. Scanorama, ZINB-WaVE, and scMerge not only mixed the CD4 and CD8 cells, but also accomplished poorer overall batch mixing. Their respective batch labels were compared to the k-means clustering labels, and a low ARI score denotes superior mixing. JC conceptualized and supervised the study. A small proportion of local distributions deviating from the global batch label ratio (i.e., rejection rate) denotes good batch mixing. Scenario 1 consisted of dataset 2 of murine tissues, and dataset 5 of human perpherial blood mononuclear cells (PBMCs). This was followed by ZINB-WaVE and scMerge with slightly lower TP. With the identified clusters, the factor loading quantiles are then normalized to match a chosen reference dataset (typically the set with the largest number of cells), thus accomplishing batch correction. PubMed Central However, with a greater imbalance in cell numbers (simulations 5 and 6, with 80 cells in batch 1 and 400 cells in batch 2), the number of TP was instead significantly lower compared to other simulations. Mapping the mouse cell atlas by Microwell-seq. What do you think about the feature selection part in single-cell RNA analysis? Quantitative evaluation of 14 batch-effect correction methods using the four assessment metrics a ASW, b ARI, c LISI, and d kBET on dataset 10 of mouse hematopoietic stem and progenitor cells. Accessed 1 Mar 2019. TP, FP, FN, TN, precision, and F-score were computed for each simulated dataset, and the median F-score over 6 simulated datasets was used to rank the batch correction methods. However, it is also unfortunate that method such as Seurat 3 was less successful than older and less sophisticated methods when handling such well-behaved input. Can I manually add nCount_RNA & nFeature_RNA to a converted Seurat object? What are the relative advantages and disadvantages of the two approaches?
The last scenario explores the impact of batch correction on differentially expressed genes. from http://dropviz.org/ and extracted the Digital Gene Expression (DGE) matrices of cells from the .raw.dge.txt.gz files found under the “DGE By Region” section. I am performing combined analysis of three scRNA-seq samples from different donors generated using 10x Genomics technology. While it is a challenge to combine different types of assessment methods into a unique evaluation score index, a single integrated index that accounts for both batch mixing and cell type mixing will be better to evaluate the batch-corrected output. To scale the median scores, we used the respective maximum and minimum scores. A popular and successful approach, pioneered by Haghverdi et al. 2). The ARI measures the percentage of matches between two label lists, corrected for chance. Or It will be great if you can share a public github link that was useful for you or any notebook that you have prepared yourself? Wolf FA, Angerer P, Theis FJ. False negatives (FN) is the number of genes which are true DEGs but detected as non-significant. In this study, we used the latest version of BBKNN (version 1.3.2) to perform the analysis.
We compare 14 methods in terms of computational runtime, the ability to handle large datasets, and batch-effect correction efficacy while preserving cell type purity. Because the number of nearest neighbors k as input has a big impact on the results of kBET, we ran kBET using a predefined list of k values. Methods with higher kBET acceptance rates are the better performing methods.
Qualitative evaluation of 14 batch-effect correction methods using UMAP visualization for dataset 4 of human pancreatic cells. Single-cell profiling of the developing mouse brain and spinal cord with split-pool barcoding. I have 3 patients with normal and tumor tissue sample(10× technology), there are six samples... Hi guys, If the fraction of rejections is close to zero, this signifies that the batches are well mixed. Further MNN algorithm development. The representation is composed of two parts: a set of batch-specific factors and a set of shared factors. 20 and Additional file 7: Table S6, with the same batch size and different drop-out rate (case 1 compared to 2, and case 3 compared to 4), the TP and FP counts were on average similar between the case of a high drop-out rate compared to a low drop-out rate, except for scGen.
In the second step, Harmony computes a global centroid for each cluster and a centroid for each specific dataset. I have a question about how to decide on the clustering resolution to use for single-cell RN... Dear all, Ten out of 14 methods (BBKNN, ComBat, Harmony, LIGER, limma, MMD-ResNet, Scanorama, scGen, Seurat 3, and ZINB-WaVE) were able to complete runs on datasets 8 and 9, while the remainder did not complete due to insufficient memory or excessively long runtime. The cell numbers are unevenly distributed across cell types, and the bulk of the cells in batch 2 consist of astrocytes, neurons, oligodendrocytes, and polydendrocytes. b Memory usage of ten methods on dataset 8. c Runtime of 14 methods on ten datasets. Springer Nature. In particular, Scanorama’s F-score was lower than the raw, implying that the method removed most of the cell type variation between “Group 1” and “Group 2.” This is a crucial point, as the goal of batch correction is to remove variations due to data acquisition under different conditions and technologies, while preserving variations of biological origin.