Dependency-based Semantic Role Labeling (SRL) is bound to dependency parsing, as the arguments of a predicate are identified through the token that heads the dependency relation subtree of the argument span. However, dependency-based SRL corpora are susceptible to the dissociated nucleus problem: when a subclause’s semantic and structural cores are two separate words, the dependency tree chooses the structural token as the head of the subtree, coercing the SRL annotation into making the same choice. This leads to undesirable consequences: when directly using the output of a dependency-based SRL method in downstream tasks it is useful to work with the token representing the semantic core of a subclause, not the structural core. In this paper, we carry out a linguistically-driven investigation on the dissociated nucleus problem in dependency-based SRL and propose a novel algorithm that aligns predicate-argument structures to the syntactic structures from Universal Dependencies to select the semantic core of an argument. Our analysis shows that dissociated nuclei appear more often than one might expect, and that our novel algorithm greatly increases the richness of the semantic information in dependency-based SRL. We release the software to reproduce our experiments at https://github.com/SapienzaNLP/semdepalign.

Exploring the Dissociated Nucleus Phenomenon in Semantic Role Labeling / Bonomo, Tommaso; Conia, Simone; Navigli, Roberto. - (2024). (Intervento presentato al convegno Italian Conference on Computational Linguistics tenutosi a Pisa; Italy).

Exploring the Dissociated Nucleus Phenomenon in Semantic Role Labeling

Tommaso Bonomo
Primo
;
Simone Conia;Roberto Navigli
Ultimo
2024

Abstract

Dependency-based Semantic Role Labeling (SRL) is bound to dependency parsing, as the arguments of a predicate are identified through the token that heads the dependency relation subtree of the argument span. However, dependency-based SRL corpora are susceptible to the dissociated nucleus problem: when a subclause’s semantic and structural cores are two separate words, the dependency tree chooses the structural token as the head of the subtree, coercing the SRL annotation into making the same choice. This leads to undesirable consequences: when directly using the output of a dependency-based SRL method in downstream tasks it is useful to work with the token representing the semantic core of a subclause, not the structural core. In this paper, we carry out a linguistically-driven investigation on the dissociated nucleus problem in dependency-based SRL and propose a novel algorithm that aligns predicate-argument structures to the syntactic structures from Universal Dependencies to select the semantic core of an argument. Our analysis shows that dissociated nuclei appear more often than one might expect, and that our novel algorithm greatly increases the richness of the semantic information in dependency-based SRL. We release the software to reproduce our experiments at https://github.com/SapienzaNLP/semdepalign.
2024
Italian Conference on Computational Linguistics
natural language processing; semantic role labeling; multilingual NLP; semantics
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Exploring the Dissociated Nucleus Phenomenon in Semantic Role Labeling / Bonomo, Tommaso; Conia, Simone; Navigli, Roberto. - (2024). (Intervento presentato al convegno Italian Conference on Computational Linguistics tenutosi a Pisa; Italy).
File allegati a questo prodotto
File Dimensione Formato  
Bonomo_Exploring_2024.pdf

accesso aperto

Note: https://ceur-ws.org/Vol-3878/11_main_long.pdf
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 1.27 MB
Formato Adobe PDF
1.27 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1728118
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact