Multilingual and cross-lingual Semantic Role Labeling (SRL) have recently garnered increasing attention as multilingual text representation techniques have become more effective and widely available. While recent work has attained growing success, results on gold multilingual benchmarks are still not easily comparable across languages, making it difficult to grasp where we stand. For example, in CoNLL-2009, the standard benchmark for multilingual SRL, language-to-language comparisons are affected by the fact that each language has its own dataset which differs from the others in size, domains, sets of labels and annotation guidelines. In this paper, we address this issue and propose UniteD-SRL, a new benchmark for multilingual and cross-lingual, span- and dependency-based SRL. UniteD-SRL provides expert-curated parallel annotations using a common predicate-argument structure inventory, allowing direct comparisons across languages and encouraging studies on cross-lingual transfer in SRL.

UniteD-SRL: A Unified Dataset for Span-and Dependency-Based Multilingual and Cross-Lingual Semantic Role Labeling / Tripodi, Rocco; Conia, Simone; Navigli, Roberto. - (2021), pp. 2293-2305. (Intervento presentato al convegno Empirical Methods in Natural Language Processing tenutosi a Punta Cana; Dominican Republic) [10.18653/v1/2021.findings-emnlp.197].

UniteD-SRL: A Unified Dataset for Span-and Dependency-Based Multilingual and Cross-Lingual Semantic Role Labeling

Simone Conia
Secondo
;
Roberto Navigli
Ultimo
2021

Abstract

Multilingual and cross-lingual Semantic Role Labeling (SRL) have recently garnered increasing attention as multilingual text representation techniques have become more effective and widely available. While recent work has attained growing success, results on gold multilingual benchmarks are still not easily comparable across languages, making it difficult to grasp where we stand. For example, in CoNLL-2009, the standard benchmark for multilingual SRL, language-to-language comparisons are affected by the fact that each language has its own dataset which differs from the others in size, domains, sets of labels and annotation guidelines. In this paper, we address this issue and propose UniteD-SRL, a new benchmark for multilingual and cross-lingual, span- and dependency-based SRL. UniteD-SRL provides expert-curated parallel annotations using a common predicate-argument structure inventory, allowing direct comparisons across languages and encouraging studies on cross-lingual transfer in SRL.
2021
Empirical Methods in Natural Language Processing
natural language processing; artificial intelligence; semantic role labeling; multilinguality; cross-linguality
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
UniteD-SRL: A Unified Dataset for Span-and Dependency-Based Multilingual and Cross-Lingual Semantic Role Labeling / Tripodi, Rocco; Conia, Simone; Navigli, Roberto. - (2021), pp. 2293-2305. (Intervento presentato al convegno Empirical Methods in Natural Language Processing tenutosi a Punta Cana; Dominican Republic) [10.18653/v1/2021.findings-emnlp.197].
File allegati a questo prodotto
File Dimensione Formato  
Tripodi_UNITED-SRL_2021.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 875.15 kB
Formato Adobe PDF
875.15 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1604119
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 0
social impact