Genome sequencing of the human parasite Schistosoma mansoni revealed an interesting gene superfamily, called micro-exon gene (meg), that encodes secreted MEG proteins. The genes are composed of short exons (3-81 base pairs) regularly interspersed with long introns (up to 5 kbp). This article recollects 35 S. mansoni specific meg genes that are distributed over 7 autosomes and one pair of sex chromosomes and that code for at least 87 verified MEG proteins. We used various bioinformatics tools to produce an optimal alignment and propose a phylogenetic analysis. This work highlighted intriguing conserved patterns/motifs in the sequences of the highly variable MEG proteins. Based on the analyses, we were able to classify the verified MEG proteins into two subfamilies and to hypothesize their duplication and colonization of all the chromosomes. Together with motif identification, we also proposed to revisit MEGs' common names and annotation in order to avoid duplication, to help the reproducibility of research results and to avoid possible misunderstandings.
Revisiting Schistosoma mansoni Micro-Exon Gene (MEG) protein family: a tour into conserved motifs and annotation / Nedvědová, Štěpánka; De Stefano, Davide; Walker, Olivier; Hologne, Maggy; Miele, Adriana Erica. - In: BIOMOLECULES. - ISSN 2218-273X. - 13:9(2023). [10.3390/biom13091275]
Revisiting Schistosoma mansoni Micro-Exon Gene (MEG) protein family: a tour into conserved motifs and annotation
Adriana Erica Miele
Ultimo
Writing – Original Draft Preparation
2023
Abstract
Genome sequencing of the human parasite Schistosoma mansoni revealed an interesting gene superfamily, called micro-exon gene (meg), that encodes secreted MEG proteins. The genes are composed of short exons (3-81 base pairs) regularly interspersed with long introns (up to 5 kbp). This article recollects 35 S. mansoni specific meg genes that are distributed over 7 autosomes and one pair of sex chromosomes and that code for at least 87 verified MEG proteins. We used various bioinformatics tools to produce an optimal alignment and propose a phylogenetic analysis. This work highlighted intriguing conserved patterns/motifs in the sequences of the highly variable MEG proteins. Based on the analyses, we were able to classify the verified MEG proteins into two subfamilies and to hypothesize their duplication and colonization of all the chromosomes. Together with motif identification, we also proposed to revisit MEGs' common names and annotation in order to avoid duplication, to help the reproducibility of research results and to avoid possible misunderstandings.File | Dimensione | Formato | |
---|---|---|---|
Nedvedová_Revisiting_2023.pdf
accesso aperto
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Creative commons
Dimensione
2.39 MB
Formato
Adobe PDF
|
2.39 MB | Adobe PDF | |
Nedvedová_Revisiting_2023_suppl.pdf
accesso aperto
Tipologia:
Altro materiale allegato
Licenza:
Creative commons
Dimensione
203.78 kB
Formato
Adobe PDF
|
203.78 kB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.