Catalogo dei prodotti della ricerca

We introduce EUREKA, an ensemble-based approach for performing automatic euphemism detection. We (1) identify and correct potentially mislabelled rows in the dataset, (2) curate an expanded corpus called EuphAug, (3) leverage model representations of Potentially Euphemistic Terms (PETs), and (4) explore using representations of semantically close sentences to aid in classification. Using our augmented dataset and kNN-based methods, EUREKA was able to achieve state-of-the-art results on the public leaderboard of the Euphemism Detection Shared Task, ranking first with a macro F1 score of 0.881.

EUREKA: EUphemism Recognition Enhanced through Knn-based methods and Augmentation / Scott Keh, S., Bharadwaj, R.K., Liu, E., Tedeschi, S., Gangal, V., Navigli, R.. - (2022), pp. 111-117. (3rd Workshop on Figurative Language Processing (FLP) Abu Dhabi; United Arab Emirates ) [10.18653/v1/2022.flp-1.15].

EUREKA: EUphemism Recognition Enhanced through Knn-based methods and Augmentation

Sedrick Scott Keh;Rohit K. Bharadwaj;Emmy Liu;Simone Tedeschi;Varun Gangal;Roberto Navigli

2022

Abstract

We introduce EUREKA, an ensemble-based approach for performing automatic euphemism detection. We (1) identify and correct potentially mislabelled rows in the dataset, (2) curate an expanded corpus called EuphAug, (3) leverage model representations of Potentially Euphemistic Terms (PETs), and (4) explore using representations of semantically close sentences to aid in classification. Using our augmented dataset and kNN-based methods, EUREKA was able to achieve state-of-the-art results on the public leaderboard of the Euphemism Detection Shared Task, ranking first with a macro F1 score of 0.881.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2022
			
	Nome convegno
	
				3rd Workshop on Figurative Language Processing (FLP)
			
	Parole chiave
	
				Natural Language Processing; Figurative Language; Euphemisms
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				EUREKA: EUphemism Recognition Enhanced through Knn-based methods and Augmentation / Scott Keh, S., Bharadwaj, R.K., Liu, E., Tedeschi, S., Gangal, V., Navigli, R.. - (2022), pp. 111-117. (3rd Workshop on Figurative Language Processing (FLP) Abu Dhabi; United Arab Emirates ) [10.18653/v1/2022.flp-1.15].
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

File	Dimensione	Formato
Keh_Eureka_2022.pdf accesso aperto Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Creative commons Dimensione 960.87 kB Formato Adobe PDF	960.87 kB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1671588

Citazioni

ND

8

2

social impact