Motivation: Single-cell RNA sequencing allows high-resolution views of individual cells for libraries of up to millions of samples, thus motivating the use of deep learning for analysis. In this study, we introduce the use of graph neural networks for the unsupervised exploration of scRNA-seq data by developing a variational graph autoencoder architecture with graph attention layers that operates directly on the connectivity between cells, focusing on dimensionality reduction and clustering. With the help of several case studies, we show that our model, named CellVGAE, can be effectively used for exploratory analysis even on challenging datasets, by extracting meaningful features from the data and providing the means to visualize and interpret different aspects of the model. Results: We show that CellVGAE is more interpretable than existing scRNA-seq variational architectures by analysing the graph attention coefficients. By drawing parallels with other scRNA-seq studies on interpretability, we assess the validity of the relationships modelled by attention, and furthermore, we show that CellVGAE can intrinsically capture information such as pseudotime and NF-KB activation dynamics, the latter being a property that is not generally shared by existing neural alternatives. We then evaluate the dimensionality reduction and clustering performance on 9 difficult and well-annotated datasets by comparing with three leading neural and non-neural techniques, concluding that CellVGAE outperforms competing methods. Finally, we report a decrease in training times of up to × 20 on a dataset of 1.3 million cells compared to existing deep learning architectures. Availabilityand implementation: The CellVGAE code is available at https://github.com/davidbuterez/CellVGAE.

CellVGAE: An unsupervised scRNA-seq analysis workflow with graph attention networks / Buterez, D.; Bica, I.; Tariq, I.; Andres-Terre, H.; Lio, P.. - In: BIOINFORMATICS. - ISSN 1367-4803. - 38:5(2022), pp. 1277-1286. [10.1093/bioinformatics/btab804]

CellVGAE: An unsupervised scRNA-seq analysis workflow with graph attention networks

Lio P.
2022

Abstract

Motivation: Single-cell RNA sequencing allows high-resolution views of individual cells for libraries of up to millions of samples, thus motivating the use of deep learning for analysis. In this study, we introduce the use of graph neural networks for the unsupervised exploration of scRNA-seq data by developing a variational graph autoencoder architecture with graph attention layers that operates directly on the connectivity between cells, focusing on dimensionality reduction and clustering. With the help of several case studies, we show that our model, named CellVGAE, can be effectively used for exploratory analysis even on challenging datasets, by extracting meaningful features from the data and providing the means to visualize and interpret different aspects of the model. Results: We show that CellVGAE is more interpretable than existing scRNA-seq variational architectures by analysing the graph attention coefficients. By drawing parallels with other scRNA-seq studies on interpretability, we assess the validity of the relationships modelled by attention, and furthermore, we show that CellVGAE can intrinsically capture information such as pseudotime and NF-KB activation dynamics, the latter being a property that is not generally shared by existing neural alternatives. We then evaluate the dimensionality reduction and clustering performance on 9 difficult and well-annotated datasets by comparing with three leading neural and non-neural techniques, concluding that CellVGAE outperforms competing methods. Finally, we report a decrease in training times of up to × 20 on a dataset of 1.3 million cells compared to existing deep learning architectures. Availabilityand implementation: The CellVGAE code is available at https://github.com/davidbuterez/CellVGAE.
2022
Cluster Analysis; Gene Expression Profiling; Sequence Analysis, RNA; Single-Cell Analysis; Single-Cell Gene Expression Analysis; Workflow
01 Pubblicazione su rivista::01a Articolo in rivista
CellVGAE: An unsupervised scRNA-seq analysis workflow with graph attention networks / Buterez, D.; Bica, I.; Tariq, I.; Andres-Terre, H.; Lio, P.. - In: BIOINFORMATICS. - ISSN 1367-4803. - 38:5(2022), pp. 1277-1286. [10.1093/bioinformatics/btab804]
File allegati a questo prodotto
File Dimensione Formato  
Buterez_CellVGAE_2022.pdf

accesso aperto

Note: DOI 10.1093/bioinformatics/btab804
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 2.8 MB
Formato Adobe PDF
2.8 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1719137
Citazioni
  • ???jsp.display-item.citation.pmc??? 10
  • Scopus 19
  • ???jsp.display-item.citation.isi??? 14
social impact