Using machine learning techniques to build representations from biomedical data can help us understand the latent biological mechanism of action and lead to important discoveries. Recent developments in single-cell RNA-sequencing protocols have allowed measuring gene expression for individual cells in a population, thus opening up the possibility of finding answers to biomedical questions about cell differentiation. In this paper, we explore unsupervised generative neural methods, based on the variational autoencoder, that can model cell differentiation by building meaningful representations from the high dimensional and complex gene expression data. We use disentanglement methods based on information theory to improve the data representation and achieve better separation of the biological factors of variation in the gene expression data. In addition, we use a graph autoencoder consisting of graph convolutional layers to predict relationships between single-cells. Based on these models, we develop a computational framework that consists of methods for identifying the cell types in the dataset, finding driver genes for the differentiation process and obtaining a better understanding of relationships between cells. We illustrate our methods on datasets from multiple species and also from different sequencing technologies.

Unsupervised generative and graph representation learning for modelling cell differentiation / Bica, I.; Andres-Terre, H.; Cvejic, A.; Lio, P.. - In: SCIENTIFIC REPORTS. - ISSN 2045-2322. - 10:1(2020). [10.1038/s41598-020-66166-8]

Unsupervised generative and graph representation learning for modelling cell differentiation

Lio P.
2020

Abstract

Using machine learning techniques to build representations from biomedical data can help us understand the latent biological mechanism of action and lead to important discoveries. Recent developments in single-cell RNA-sequencing protocols have allowed measuring gene expression for individual cells in a population, thus opening up the possibility of finding answers to biomedical questions about cell differentiation. In this paper, we explore unsupervised generative neural methods, based on the variational autoencoder, that can model cell differentiation by building meaningful representations from the high dimensional and complex gene expression data. We use disentanglement methods based on information theory to improve the data representation and achieve better separation of the biological factors of variation in the gene expression data. In addition, we use a graph autoencoder consisting of graph convolutional layers to predict relationships between single-cells. Based on these models, we develop a computational framework that consists of methods for identifying the cell types in the dataset, finding driver genes for the differentiation process and obtaining a better understanding of relationships between cells. We illustrate our methods on datasets from multiple species and also from different sequencing technologies.
2020
RNA Sequence; Single-Cell Analysis; Gene Expression Profiling
01 Pubblicazione su rivista::01a Articolo in rivista
Unsupervised generative and graph representation learning for modelling cell differentiation / Bica, I.; Andres-Terre, H.; Cvejic, A.; Lio, P.. - In: SCIENTIFIC REPORTS. - ISSN 2045-2322. - 10:1(2020). [10.1038/s41598-020-66166-8]
File allegati a questo prodotto
File Dimensione Formato  
Bica_Unsupervised_2021.pdf

accesso aperto

Note: RNA Sequence; Single-Cell Analysis; Gene Expression Profiling
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 4.48 MB
Formato Adobe PDF
4.48 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1719686
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 11
  • ???jsp.display-item.citation.isi??? 9
social impact