An improved method for detection of words with unusual occurrence frequency in nucleotidic sequences

Colosimo, Alfredo; Morante, S.; Parisi, V.; Rossi, G. C.

doi:10.1006/jtbi.1993.1212

A statistical analysis designed to deal with the problem of identifying rare or abundant "words" of arbitrary length in genomic fragments is presented. Our approach has the novelty of taking into account the statistical role of the presence of shorter words nested into longer ones and of introducing a Bayesian correction to minimize the effects of statistical fluctuations and of possible mistakes in genomic data. The method is successfully used in a thorough analysis of the abundance of short nucleotide sequences in the Escherichia coli genome.

An improved method for detection of words with unusual occurrence frequency in nucleotidic sequences / Colosimo, A., S., M., G. C., R.. - In: JOURNAL OF THEORETICAL BIOLOGY. - ISSN 0022-5193. - STAMPA. - 165:(1993), pp. 659-672. [10.1006/jtbi.1993.1212]

An improved method for detection of words with unusual occurrence frequency in nucleotidic sequences

COLOSIMO, Alfredo;S. Morante;V. Parisi;G. C. Rossi

1993

Abstract

A statistical analysis designed to deal with the problem of identifying rare or abundant "words" of arbitrary length in genomic fragments is presented. Our approach has the novelty of taking into account the statistical role of the presence of shorter words nested into longer ones and of introducing a Bayesian correction to minimize the effects of statistical fluctuations and of possible mistakes in genomic data. The method is successfully used in a thorough analysis of the abundance of short nucleotide sequences in the Escherichia coli genome.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				1993
			
	Parole chiave
	
				genome analysis; bayesian analysis; nucleotide sequences.
			
	Tipologia
	
				01 Pubblicazione su rivista::01a Articolo in rivista
			
	Citazione
	
				An improved method for detection of words with unusual occurrence frequency in nucleotidic sequences / Colosimo, A., S., M., G. C., R.. - In: JOURNAL OF THEORETICAL BIOLOGY. - ISSN 0022-5193. - STAMPA. - 165:(1993), pp. 659-672. [10.1006/jtbi.1993.1212]
			
	Appartiene alla tipologia:
	
				01a Articolo in rivista

File allegati a questo prodotto

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/473209

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

3

8

7

Catalogo dei prodotti della ricerca