Catalogo dei prodotti della ricerca

Background: The analysis and interpretation of data generated from patient-derived clinical samples relies on access to high-quality bioinformatics resources. These are maintained and updated by expert curators extracting knowledge from unstructured biological data described in free-text journal articles and converting this into more structured, computationally-accessible forms. This enables analyses such as functional enrichment of sets of genes/proteins using the Gene Ontology, and makes the searching of data more productive by managing issues such as gene/protein name synonyms, identifier mapping, and data quality. Objective: To undertake a coordinated annotation update of key public-domain resources to better support Alzheimer's disease research. Methods: We have systematically identified target proteins critical to disease process, in part by accessing informed input from the clinical research community. Results: Data from 954 papers have been added to the UniProtKB, Gene Ontology, and the International Molecular Exchange Consortium (IMEx) databases, with 299 human proteins and 279 orthologs updated in UniProtKB. 745 binary interactions were added to the IMEx human molecular interaction dataset. Conclusion: This represents a significant enhancement in the expert curated data pertinent to Alzheimer's disease available in a number of biomedical databases. Relevant protein entries have been updated in UniProtKB and concomitantly in the Gene Ontology. Molecular interaction networks have been significantly extended in the IMEx Consortium dataset and a set of reference protein complexes created. All the resources described are open-source and freely available to the research community and we provide examples of how these data could be exploited by researchers.

A coordinated approach by public domain bioinformatics resources to aid the fight against Alzheimer's disease through expert curation of key protein targets / Breuza, L., Arighi, C.n., Argoud-Puy, G., Casals-Casas, C., Estreicher, A., Famiglietti, M.l., Georghiou, G., Gos, A., Gruaz-Gumowski, N., Hinz, U., Hyka-Nouspikel, N., Kramarz, B., Lovering, R.c., Lussi, Y., Magrane, M., Masson, P., Perfetto, L., Poux, S., Rodriguez-Lopez, M., Stoeckert, C., et al.. - In: JOURNAL OF ALZHEIMER'S DISEASE. - ISSN 1387-2877. - 77:1(2020), pp. 257-273. [10.3233/JAD-200206]

A coordinated approach by public domain bioinformatics resources to aid the fight against Alzheimer's disease through expert curation of key protein targets

Breuza L;Arighi CN;Argoud-Puy G;Casals-Casas C;Estreicher A;Famiglietti ML;Georghiou G;Gos A;Gruaz-Gumowski N;Hinz U;Hyka-Nouspikel N;Kramarz B;Lovering RC;Lussi Y;Magrane M;Masson P;Perfetto L;Poux S;Rodriguez-Lopez M;Stoeckert C;Sundaram S;Wang LS;Wu EL;Orchard S

2020

Abstract

Background: The analysis and interpretation of data generated from patient-derived clinical samples relies on access to high-quality bioinformatics resources. These are maintained and updated by expert curators extracting knowledge from unstructured biological data described in free-text journal articles and converting this into more structured, computationally-accessible forms. This enables analyses such as functional enrichment of sets of genes/proteins using the Gene Ontology, and makes the searching of data more productive by managing issues such as gene/protein name synonyms, identifier mapping, and data quality. Objective: To undertake a coordinated annotation update of key public-domain resources to better support Alzheimer's disease research. Methods: We have systematically identified target proteins critical to disease process, in part by accessing informed input from the clinical research community. Results: Data from 954 papers have been added to the UniProtKB, Gene Ontology, and the International Molecular Exchange Consortium (IMEx) databases, with 299 human proteins and 279 orthologs updated in UniProtKB. 745 binary interactions were added to the IMEx human molecular interaction dataset. Conclusion: This represents a significant enhancement in the expert curated data pertinent to Alzheimer's disease available in a number of biomedical databases. Relevant protein entries have been updated in UniProtKB and concomitantly in the Gene Ontology. Molecular interaction networks have been significantly extended in the IMEx Consortium dataset and a set of reference protein complexes created. All the resources described are open-source and freely available to the research community and we provide examples of how these data could be exploited by researchers.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2020
			
	Parole chiave
	
				Alzheimer’s disease; Cytoscape network analysis; data curation; database; neurobiology; protein.
			
	Tipologia
	
				01 Pubblicazione su rivista::01a Articolo in rivista
			
	Citazione
	
				A coordinated approach by public domain bioinformatics resources to aid the fight against Alzheimer's disease through expert curation of key protein targets / Breuza, L., Arighi, C.n., Argoud-Puy, G., Casals-Casas, C., Estreicher, A., Famiglietti, M.l., Georghiou, G., Gos, A., Gruaz-Gumowski, N., Hinz, U., Hyka-Nouspikel, N., Kramarz, B., Lovering, R.c., Lussi, Y., Magrane, M., Masson, P., Perfetto, L., Poux, S., Rodriguez-Lopez, M., Stoeckert, C., et al.. - In: JOURNAL OF ALZHEIMER'S DISEASE. - ISSN 1387-2877. - 77:1(2020), pp. 257-273. [10.3233/JAD-200206]
			
	Appartiene alla tipologia:
	
				01a Articolo in rivista

File allegati a questo prodotto

File	Dimensione	Formato
Breuza_Coordinated_2020.pdf accesso aperto Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Creative commons Dimensione 2.08 MB Formato Adobe PDF	2.08 MB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1660177

Citazioni

5

5

5

social impact