Current methods for the identification of putatively co-regulated genes directly from gene expression time profiles are based on the similarity of the time profile. Such association metrics, despite their central role in gene network inference and machine learning, have largely ignored the impact of dynamics or variation in mRNA stability. Here we introduce a simple, but powerful, new similarity metric called lead-lag R2 that successfully accounts for the properties of gene dynamics, including varying mRNA degradation and delays. Using yeast cell-cycle time-series gene expression data, we demonstrate that the predictive power of lead-lag R2 for the identification of co-regulated genes is significantly higher than that of standard similarity measures, thus allowing the selection of a large number of entirely new putatively co-regulated genes. Furthermore, the lead-lag metric can also be used to uncover the relationship between gene expression time-series and the dynamics of formation of multiple protein complexes. Remarkably, we found a high lead-lag R2 value among genes coding for a transient complex.

Embedding mRNA stability in correlation analysis of time-series gene expression data / Farina, Lorenzo; DE SANTIS, Alberto; Salvucci, S; Morelli, G; Ruberti, I.. - In: PLOS COMPUTATIONAL BIOLOGY. - ISSN 1553-7358. - 4(8):(2008), pp. 1-12. [10.1371/journal.pcbi.1000141]

Embedding mRNA stability in correlation analysis of time-series gene expression data

FARINA, Lorenzo;DE SANTIS, Alberto;
2008

Abstract

Current methods for the identification of putatively co-regulated genes directly from gene expression time profiles are based on the similarity of the time profile. Such association metrics, despite their central role in gene network inference and machine learning, have largely ignored the impact of dynamics or variation in mRNA stability. Here we introduce a simple, but powerful, new similarity metric called lead-lag R2 that successfully accounts for the properties of gene dynamics, including varying mRNA degradation and delays. Using yeast cell-cycle time-series gene expression data, we demonstrate that the predictive power of lead-lag R2 for the identification of co-regulated genes is significantly higher than that of standard similarity measures, thus allowing the selection of a large number of entirely new putatively co-regulated genes. Furthermore, the lead-lag metric can also be used to uncover the relationship between gene expression time-series and the dynamics of formation of multiple protein complexes. Remarkably, we found a high lead-lag R2 value among genes coding for a transient complex.
2008
01 Pubblicazione su rivista::01a Articolo in rivista
Embedding mRNA stability in correlation analysis of time-series gene expression data / Farina, Lorenzo; DE SANTIS, Alberto; Salvucci, S; Morelli, G; Ruberti, I.. - In: PLOS COMPUTATIONAL BIOLOGY. - ISSN 1553-7358. - 4(8):(2008), pp. 1-12. [10.1371/journal.pcbi.1000141]
File allegati a questo prodotto
File Dimensione Formato  
VE_2008_11573-228575.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 725.36 kB
Formato Adobe PDF
725.36 kB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/228575
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 12
  • ???jsp.display-item.citation.isi??? 11
social impact