The problem of detecting a binding site - a substring of DNA where transcription factors attach - on a long DNA sequence requires the recognition of a small pattern in a large background. For short binding sites, the matching probability can display large fluctuations from one putative binding site to another. Here we use a self-consistent statistical procedure that accounts correctly for the large deviations of the matching probability to predict the location of short binding sites. We apply it in two distinct situations: a) the detection of the binding sites for three specific transcription factors on a set of 134 estrogen-regulated genes; b) the identification, in a set of 138 possible transcription factors, of the ones binding a specific set of nine genes. In both instances, experimental findings are reproduced (when available) and the number of false positives is significantly reduced with respect to the other methods commonly employed. Copyright © 2008 EPLA.

Identifying short motifs by means of extreme value analysis / D., Bianchi; Tirozzi, Benedetto. - In: EUROPHYSICS LETTERS. - ISSN 0295-5075. - 84:1(2008), pp. 18001-18006. [10.1209/0295-5075/84/18001]

Identifying short motifs by means of extreme value analysis

TIROZZI, Benedetto
2008

Abstract

The problem of detecting a binding site - a substring of DNA where transcription factors attach - on a long DNA sequence requires the recognition of a small pattern in a large background. For short binding sites, the matching probability can display large fluctuations from one putative binding site to another. Here we use a self-consistent statistical procedure that accounts correctly for the large deviations of the matching probability to predict the location of short binding sites. We apply it in two distinct situations: a) the detection of the binding sites for three specific transcription factors on a set of 134 estrogen-regulated genes; b) the identification, in a set of 138 possible transcription factors, of the ones binding a specific set of nine genes. In both instances, experimental findings are reproduced (when available) and the number of false positives is significantly reduced with respect to the other methods commonly employed. Copyright © 2008 EPLA.
2008
01 Pubblicazione su rivista::01a Articolo in rivista
Identifying short motifs by means of extreme value analysis / D., Bianchi; Tirozzi, Benedetto. - In: EUROPHYSICS LETTERS. - ISSN 0295-5075. - 84:1(2008), pp. 18001-18006. [10.1209/0295-5075/84/18001]
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/30796
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact