Precise reconstruction of the TME using bulk RNA-seq and a machine learning algorithm trained on artificial transcriptomes.

TitlePrecise reconstruction of the TME using bulk RNA-seq and a machine learning algorithm trained on artificial transcriptomes.
Publication TypeJournal Article
Year of Publication2022
AuthorsZaitsev A, Chelushkin M, Dyikanov D, Cheremushkin I, Shpak B, Nomie K, Zyrin V, Nuzhdina E, Lozinsky Y, Zotova A, Degryse S, Kotlov N, Baisangurov A, Shatsky V, Afenteva D, Kuznetsov A, Paul SRaju, Davies DL, Reeves PM, Lanuti M, Goldberg MF, Tazearslan C, Chasse M, Wang I, Abdou M, Aslanian SM, Andrewes S, Hsieh JJ, Ramachandran A, Lyu Y, Galkin I, Svekolkin V, Cerchietti L, Poznansky MC, Ataullakhanov R, Fowler N, Bagaev A
JournalCancer Cell
Volume40
Issue8
Pagination879-894.e16
Date Published2022 Aug 08
ISSN1878-3686
KeywordsAlgorithms, CD8-Positive T-Lymphocytes, Humans, Machine Learning, Neoplasms, RNA-Seq, Sequence Analysis, RNA, Transcriptome, Tumor Microenvironment
Abstract

Cellular deconvolution algorithms virtually reconstruct tissue composition by analyzing the gene expression of complex tissues. We present the decision tree machine learning algorithm, Kassandra, trained on a broad collection of >9,400 tissue and blood sorted cell RNA profiles incorporated into millions of artificial transcriptomes to accurately reconstruct the tumor microenvironment (TME). Bioinformatics correction for technical and biological variability, aberrant cancer cell expression inclusion, and accurate quantification and normalization of transcript expression increased Kassandra stability and robustness. Performance was validated on 4,000 H&E slides and 1,000 tissues by comparison with cytometric, immunohistochemical, or single-cell RNA-seq measurements. Kassandra accurately deconvolved TME elements, showing the role of these populations in tumor pathogenesis and other biological processes. Digital TME reconstruction revealed that the presence of PD-1-positive CD8+ T cells strongly correlated with immunotherapy response and increased the predictive potential of established biomarkers, indicating that Kassandra could potentially be utilized in future clinical applications.

DOI10.1016/j.ccell.2022.07.006
Alternate JournalCancer Cell
PubMed ID35944503