![](https://www.cnag.eu/sites/default/files/styles/image_news/public/PHOTO%20FOR%20NEWS%20%28WEB%29-30.jpg?itok=9zErjiye)
• Researchers from the Functional Genomics Team at CNAG have identified critical limitations in standard quality control (QC) workflows for single-cell RNA sequencing (scRNA-seq), leading to the frequent inclusion of low-quality cells in single-cell transcriptomic atlases.
These poor-quality cells arise from technical artifacts such as damaged cells, incomplete cell lysis, and contamination from cytoplasmic debris. Despite this, standard QC measures, which focus on parameters such as the mitochondrial content and number of detected genes/transcripts per cell, fail to evaluate the presence of the nucleus. Moreover, these current methods have proven to be insufficient for challenging tissues and organs such as the heart, liver, or kidney. The process of obtaining high-quality cells from these tissues is particularly difficult, as the dissociation treatments are quite aggressive, damaging some cell types and releasing high amounts of RNA into solution, which then become ambient RNA and confound the analysis.
The CNAG researchers propose advocating for the routine inclusion of nuclear metrics, specifically the intronic fraction and MALAT1 expression. Null or very low levels of intronic reads or MALAT1 indicate that the nucleus has not been lysed, and only the cytosolic transcriptome has been sequenced. To demonstrate the high presence of poor-quality cells in publicly cellular atlases, the team have reanalised 500 samples from more than 150 cell types across six cell atlases, covering more than 20 human and mouse tissues.
The results shed light on the significant presence of low-quality cells in most datasets, with some cases severely compromising organs such as the kidney. The team demonstrated that filtering cells with absent nuclear markers effectively excludes poor-quality cells. “Our findings show that many datasets rely on permissive QC workflows that fail to detect nuclear content deficiencies,” explains Tomàs Montserrat, bioinformatician at CNAG and first author of the study. “This oversight is particularly critical for tissues that require aggressive dissociation protocols. We strongly recommend incorporating nuclear markers into the QC step as a best practice for single-cell analysis.”
The implications of these findings extend beyond data quality. “Reusing reference datasets with low-quality cells risks propagating errors into new analyses,” adds Anna Esteve-Codina, the study’s senior author. “Adopting stricter QC standards will enhance the reliability of cellular maps, benefiting the entire research community.”
Nuclear-content QC metrics can be applied in any single-cell RNA-seq study easily. Both nuclear metrics are compatible with existing bioinformatics tools, providing a scalable solution for improving new single-cell datasets and review those already published. However, the incorporation of these two nuclear metrics is still underused in the single-cell research community. CNAG researchers are reaching out to leading institutions in the field to present the results of this study. Indeed, the advancement not only paves the way for higher-quality cellular atlases but also underscores the importance of refining experimental and computational protocols to meet the growing demands of single-cell research.
Reference article
Montserrat-Ayuso, Tomàs, and Anna Esteve-Codina. ‘High Content of Nuclei-Free Low-Quality Cells in Reference Single-Cell Atlases: A Call for More Stringent Quality Control Using Nuclear Fraction’. BMC Genomics, vol. 25, no. 1, Nov. 2024, p. 1124. BioMed Central, https://doi.org/10.1186/s12864-024-11015-5.